Tianlong Chen; Zhenyu Zhang; Xu Ouyang; Zechun Liu; Zhiqiang Shen; Zhangyang Wang

「BNN-BN =？」：バッチ正規化なしのバイナリニューラルネットワークのトレーニング

"BNN - BN = ?": Training Binary Neural Networks without Batch Normalization

バッチ正規化（BN）は重要な促進要因であり、最先端のバイナリニューラルネットワーク（BNN）に不可欠であると考えられています。ただし、BN層は計算にコストがかかり、通常は非バイナリパラメータを使用して実装されるため、BNNトレーニングを効率的に実装するためのハードルが残ります。また、各バッチ内のサンプル間に望ましくない依存関係が生じます。 Batch Normalization Free（BN-Free）トレーニングの最新の進歩に触発されて、BNNのトレーニングにフレームワークを拡張し、BNNトレーニングおよび推論体制からBNを削除して完了できることを初めて実証しました。適応勾配クリッピング、スケール重量の標準化、特殊なボトルネックブロックなどの技術をプラグインしてカスタマイズすることにより、BNフリーのBNNは、BNベースの対応するものと比較して競争力のある精度を維持できます。広範な実験により、さまざまなBNNバックボーンとデータセットにわたる提案の有効性が検証されます。たとえば、最先端のReActNetからBNを削除した後でも、CIFAR-10、CIFAR-100、およびImageNetでそれぞれ92.08％、68.34％、および68.0％の精度を達成するために、提案された方法論でトレーニングすることができます。、わずかなパフォーマンスの低下があります（CIFARでは0.23％〜0.44％、ImageNetでは1.40％）。コードと事前トレーニング済みモデルは、https：//github.com/VITA-Group/BNN_NoBNで入手できます。

Batch normalization (BN) is a key facilitator and considered essential for state-of-the-art binary neural networks (BNN). However, the BN layer is costly to calculate and is typically implemented with non-binary parameters, leaving a hurdle for the efficient implementation of BNN training. It also introduces undesirable dependence between samples within each batch. Inspired by the latest advance on Batch Normalization Free (BN-Free) training, we extend their framework to training BNNs, and for the first time demonstrate that BNs can be completed removed from BNN training and inference regimes. By plugging in and customizing techniques including adaptive gradient clipping, scale weight standardization, and specialized bottleneck block, a BN-free BNN is capable of maintaining competitive accuracy compared to its BN-based counterpart. Extensive experiments validate the effectiveness of our proposal across diverse BNN backbones and datasets. For example, after removing BNs from the state-of-the-art ReActNets, it can still be trained with our proposed methodology to achieve 92.08%, 68.34%, and 68.0% accuracy on CIFAR-10, CIFAR-100, and ImageNet respectively, with marginal performance drop (0.23%~0.44% on CIFAR and 1.40% on ImageNet). Codes and pre-trained models are available at: https://github.com/VITA-Group/BNN_NoBN.

updated: Fri Apr 16 2021 16:46:57 GMT+0000 (UTC)

published: Fri Apr 16 2021 16:46:57 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト