FracBNN: Accurate and FPGA-Efficient Binary Neural Networks with Fractional Activations

Yichi Zhang; Junhao Pan; Xinheng Liu; Hongzheng Chen; Deming Chen; Zhiru Zhang

FracBNN：フラクショナルアクティベーションを備えた正確でFPGA効率の高いバイナリニューラルネットワーク

バイナリニューラルネットワーク（BNN）には、1ビットの重みとアクティベーションがあります。このようなネットワークは、主要な計算がビット単位の演算であり、メモリ要件も大幅に削減されるため、FPGAに最適です。ただし、最先端のコンパクト畳み込みニューラルネットワーク（CNN）モデルと比較すると、BNNはImageNetなどの現実的なデータセットではるかに低い精度を生成する傾向があります。さらに、BNNの入力層は、大きな精度の低下を回避するために従来は2値化から除外されていたため、徐々に主要な計算のボトルネックになりました。この作業は、BNNの精度を大幅に向上させるためにフラクショナルアクティベーションを活用するFracBNNを提案します。具体的には、私たちのアプローチは、追加のスパースバイナリ畳み込みを使用して、最大2ビットの特徴を計算するために二重精度のアクティブ化スキームを採用しています。さらに、新しい温度計エンコーディングを使用して入力層を2値化します。全体として、FracBNNは、すべての畳み込み層が純粋なバイナリMAC演算（BMAC）で計算される従来のBNNの主な利点を保持しています。フラクショナルアクティベーションをサポートする新しいBNNモデル用の効率的なFPGAベースのアクセラレータを設計します。リソースに制約のあるシナリオでFracBNNのパフォーマンスを評価するために、最適化されたネットワークアーキテクチャ全体を組み込みFPGA（ザイリンクスUltra96v2）に実装します。 ImageNetでの実験では、FracBNNがMobileNetV2に匹敵する精度を達成し、FPGAで最もよく知られているBNN設計を上回り、トップ1の精度が28.9％向上し、モデルサイズが2.5分の1に削減されました。 FracBNNは、同じモデルサイズを使用しながら、最近導入されたBNNモデルよりもパフォーマンスが高く、トップ1の精度が2.4％向上しています。組み込みFPGAデバイスでは、FracBNNはリアルタイム画像分類の機能を示します。

Binary neural networks (BNNs) have 1-bit weights and activations. Such networks are well suited for FPGAs, as their dominant computations are bitwise arithmetic and the memory requirement is also significantly reduced. However, compared to start-of-the-art compact convolutional neural network (CNN) models, BNNs tend to produce a much lower accuracy on realistic datasets such as ImageNet. In addition, the input layer of BNNs has gradually become a major compute bottleneck, because it is conventionally excluded from binarization to avoid a large accuracy loss. This work proposes FracBNN, which exploits fractional activations to substantially improve the accuracy of BNNs. Specifically, our approach employs a dual-precision activation scheme to compute features with up to two bits, using an additional sparse binary convolution. We further binarize the input layer using a novel thermometer encoding. Overall, FracBNN preserves the key benefits of conventional BNNs, where all convolutional layers are computed in pure binary MAC operations (BMACs). We design an efficient FPGA-based accelerator for our novel BNN model that supports the fractional activations. To evaluate the performance of FracBNN under a resource-constrained scenario, we implement the entire optimized network architecture on an embedded FPGA (Xilinx Ultra96v2). Our experiments on ImageNet show that FracBNN achieves an accuracy comparable to MobileNetV2, surpassing the best-known BNN design on FPGAs with an increase of 28.9% in top-1 accuracy and a 2.5x reduction in model size. FracBNN also outperforms a recently introduced BNN model with an increase of 2.4% in top-1 accuracy while using the same model size. On the embedded FPGA device, FracBNN demonstrates the ability of real-time image classification.

updated: Tue Dec 22 2020 17:49:30 GMT+0000 (UTC)

published: Tue Dec 22 2020 17:49:30 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト