Can Deep Neural Networks be Converted to Ultra Low-Latency Spiking Neural Networks?

Gourav Datta; Peter A. Beerel

ディープニューラルネットワークを超低遅延スパイキングニューラルネットワークに変換できますか？

時間の経過とともに分散されるバイナリスパイクを介して動作するスパイキングニューラルネットワーク（SNN）は、リソースに制約のあるデバイスの有望なエネルギー効率の高いMLパラダイムとして浮上しています。ただし、現在の最先端（SOTA）SNNは、許容可能な推論精度のために複数のタイムステップを必要とし、スパイクアクティビティを増加させ、その結果、エネルギー消費を増加させます。 SNNのSOTAトレーニング戦略には、スパイクのないディープニューラルネットワーク（DNN）からの変換が含まれます。このホワイトペーパーでは、SOTA変換戦略では、DNNとSNNの事前アクティブ化値が均一に分散されていると誤って想定しているため、超低遅延を実現できないと判断しました。 DNNと変換されたSNNの間のエラーを最小限に抑えて、これらの分布を正確にキャプチャする新しいトレーニングアルゴリズムを提案します。結果として得られるSNNは、レイテンシが非常に低く、アクティベーションのスパース性が高いため、計算効率が大幅に向上します。特に、いくつかのVGGおよびResNetアーキテクチャ上のCIFAR-10およびCIFAR-100データセットからの画像認識タスクに関するフレームワークを評価します。 CIFAR-100データセットでわずか2タイムステップで64.19％のトップ1精度が得られ、アイソアーキテクチャ標準DNNと比較して計算エネルギーが約159.2倍低くなっています。他のSOTASNNモデルと比較して、私たちのモデルは推論を2.5〜8倍高速に実行します（つまり、タイムステップが少なくなります）。

Spiking neural networks (SNNs), that operate via binary spikes distributed over time, have emerged as a promising energy efficient ML paradigm for resource-constrained devices. However, the current state-of-the-art (SOTA) SNNs require multiple time steps for acceptable inference accuracy, increasing spiking activity and, consequently, energy consumption. SOTA training strategies for SNNs involve conversion from a non-spiking deep neural network (DNN). In this paper, we determine that SOTA conversion strategies cannot yield ultra low latency because they incorrectly assume that the DNN and SNN pre-activation values are uniformly distributed. We propose a new training algorithm that accurately captures these distributions, minimizing the error between the DNN and converted SNN. The resulting SNNs have ultra low latency and high activation sparsity, yielding significant improvements in compute efficiency. In particular, we evaluate our framework on image recognition tasks from CIFAR-10 and CIFAR-100 datasets on several VGG and ResNet architectures. We obtain top-1 accuracy of 64.19% with only 2 time steps on the CIFAR-100 dataset with ~159.2x lower compute energy compared to an iso-architecture standard DNN. Compared to other SOTA SNN models, our models perform inference 2.5-8x faster (i.e., with fewer time steps).

updated: Wed Dec 22 2021 18:47:45 GMT+0000 (UTC)

published: Wed Dec 22 2021 18:47:45 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト