Kernel Normalized Convolutional Networks

Reza Nasirigerdeh; Reihaneh Torkzadehmahani; Daniel Rueckert; Georgios Kaissis

カーネル正規化畳み込みネットワーク

既存の深層畳み込みニューラルネットワーク (CNN) アーキテクチャは、モデルを効果的にトレーニングするためにバッチ正規化 (BatchNorm) に依存することがよくあります。 BatchNorm は集中型トレーニングでモデルのパフォーマンスを大幅に向上させますが、フェデレーションラーニングと差分プライバシー設定には適していません。集中型学習でも、バッチサイズが小さいと BatchNorm のパフォーマンスは低下します。これらの制限に対処するために、カーネル正規化層とカーネル正規化畳み込み層を提案し、それらを主要なビルディングブロックとしてカーネル正規化畳み込みネットワーク (KNConvNets) に組み込みます。 BatchNorm レイヤーを省略しながら、VGGNets や ResNets などの最先端の CNN に対応する KNConvNets を実装します。大規模な実験を通じて、KNConvNets は、集中型、連合型、および差別化されたプライベートな学習設定での精度と収束率の両方の点で、バッチ、グループ、およびレイヤーの正規化された対応物よりも一貫して優れていることを示しています。

Existing deep convolutional neural network (CNN) architectures frequently rely upon batch normalization (BatchNorm) to effectively train the model. BatchNorm significantly improves model performance in centralized training, but it is unsuitable for federated learning and differential privacy settings. Even in centralized learning, BatchNorm performs poorly with smaller batch sizes. To address these limitations, we propose kernel normalization and kernel normalized convolutional layers, and incorporate them into kernel normalized convolutional networks (KNConvNets) as the main building blocks. We implement KNConvNets corresponding to the state-of-the-art CNNs such as VGGNets and ResNets while forgoing BatchNorm layers. Through extensive experiments, we illustrate KNConvNets consistently outperform their batch, group, and layer normalized counterparts in terms of both accuracy and convergence rate in centralized, federated, and differentially private learning settings.

updated: Fri Sep 30 2022 19:14:19 GMT+0000 (UTC)

published: Fri May 20 2022 11:18:05 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト