RedBit: An End-to-End Flexible Framework for Evaluating the Accuracy of Quantized CNNs

André Santos; João Dinis Ferreira; Onur Mutlu; Gabriel Falcao

RedBit: 量子化された CNN の精度を評価するためのエンドツーエンドの柔軟なフレームワーク

近年、畳み込みニューラルネットワーク (CNN) は、画像処理、分類、セグメンテーションタスク用のディープニューラルネットワークの標準クラスになりました。ただし、CNN によって得られる精度の大きな進歩は、ネットワークトポロジの複雑さが増したことに由来しており、CNN のトレーニングと推論においてかなりのパフォーマンスとエネルギーのペナルティが発生します。最近の多くの研究では、ネットワークのパラメーターのビット幅を減らすことで構成されるパラメーター量子化の有効性が検証されており、精度を大幅に損なうことなく、パフォーマンスとエネルギー効率を大幅に向上させることができます。ただし、異なる量子化方法の相対的な有効性を比較することは困難です。この問題に対処するために、ネットワーク精度に対するさまざまなアルゴリズムとパラメーター構成の有効性を評価するための透過的で拡張可能で使いやすいインターフェイスを提供するオープンソースフレームワークである RedBit を導入します。 RedBit を使用して、MNIST、CIFAR-10、および ImageNet データセットに適用される 5 つの最先端の量子化方法の包括的な調査を実行します。合計 2300 の個々のビット幅の組み合わせを評価し、ネットワークの重みと入力アクティベーションパラメーターの幅を 32 ビットから 1 ビット (たとえば、8/8、2/2、1/32、1/1、ウェイト/アクティベーション用)。このホワイトペーパーのすべての結果を生成するために、最先端の GPU のプールで 20000 時間以上の計算時間が使用されました。 1 ビット量子化の場合、MNIST、CIFAR-10、および ImageNet データセットの精度損失は、それぞれ [0.26%, 0.79%]、[9.74%, 32.96%]、および [10.86%, 47.36%] の上位 1 の範囲です。ソースコードをダウンロードして RedBit で実験し、独自の観察結果を公開リポジトリ (https://github.com/IT-Coimbra/RedBit で入手可能) に送信することを積極的に推奨します。

In recent years, Convolutional Neural Networks (CNNs) have become the standard class of deep neural network for image processing, classification and segmentation tasks. However, the large strides in accuracy obtained by CNNs have been derived from increasing the complexity of network topologies, which incurs sizeable performance and energy penalties in the training and inference of CNNs. Many recent works have validated the effectiveness of parameter quantization, which consists in reducing the bit width of the network's parameters, to enable the attainment of considerable performance and energy efficiency gains without significantly compromising accuracy. However, it is difficult to compare the relative effectiveness of different quantization methods. To address this problem, we introduce RedBit, an open-source framework that provides a transparent, extensible and easy-to-use interface to evaluate the effectiveness of different algorithms and parameter configurations on network accuracy. We use RedBit to perform a comprehensive survey of five state-of-the-art quantization methods applied to the MNIST, CIFAR-10 and ImageNet datasets. We evaluate a total of 2300 individual bit width combinations, independently tuning the width of the network's weight and input activation parameters, from 32 bits down to 1 bit (e.g., 8/8, 2/2, 1/32, 1/1, for weights/activations). Upwards of 20000 hours of computing time in a pool of state-of-the-art GPUs were used to generate all the results in this paper. For 1-bit quantization, the accuracy losses for the MNIST, CIFAR-10 and ImageNet datasets range between [0.26%, 0.79%], [9.74%, 32.96%] and [10.86%, 47.36%] top-1, respectively. We actively encourage the reader to download the source code and experiment with RedBit, and to submit their own observed results to our public repository, available at https://github.com/IT-Coimbra/RedBit.

updated: Sun Jan 15 2023 21:27:35 GMT+0000 (UTC)

published: Sun Jan 15 2023 21:27:35 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト