T-RECX: Tiny-Resource Efficient Convolutional neural networks with early-eXit

Nikhil P Ghanathe; Steve Wilton

T-RECX: アーリー eXit を備えた小さなリソース効率の高い畳み込みニューラルネットワーク

機械学習 (ML) をミリワット規模のエッジデバイス (tinyML) に展開することは、ML とモノのインターネット (IoT) における最近のブレークスルーにより人気が高まっています。 tinyML の研究のほとんどは、精度 (およびモデル容量) と引き換えにコンパクトなモデルを KB サイズの小さなエッジデバイスに適合させるモデル圧縮技術に焦点を当てています。この論文では、早期終了中間分類子を追加することで、そのようなモデルをどのように強化できるかを示します。中間分類器がその予測に十分な信頼性を示した場合、ネットワークはそれによって早期に終了し、時間を大幅に節約できます。以前の研究では早期終了分類器が提案されていましたが、これらの以前の提案は大規模なネットワークに焦点を当てており、tinyML アプリケーションにとって最適ではない/非実用的な手法になっています。私たちの手法は、特に小さな CNN サイズのモデル用に最適化されています。さらに、早期終了によって学習された表現を活用することにより、ネットワークの考えすぎの影響を軽減する方法を提示します。画像分類、キーワードスポッティング、ビジュアルウェイクワード検出タスクについて、MLPerf の小さなベンチマークスイートから 3 つの CNN で T-RecX を評価します。私たちの結果は、T-RecX が 1) ベースラインネットワークの精度を向上させ、2) 評価されたすべてのモデルで 1% の精度と引き換えに、FLOPS で平均 31.58% の削減を達成することを示しています。さらに、私たちの方法は、評価する小さな CNN で人気のある以前の研究よりも一貫して優れていることを示しています。

Deploying Machine learning (ML) on milliwatt-scale edge devices (tinyML) is gaining popularity due to recent breakthroughs in ML and Internet of Things (IoT). Most tinyML research focuses on model compression techniques that trade accuracy (and model capacity) for compact models to fit into the KB-sized tiny-edge devices. In this paper, we show how such models can be enhanced by the addition of an early exit intermediate classifier. If the intermediate classifier exhibits sufficient confidence in its prediction, the network exits early thereby, resulting in considerable savings in time. Although early exit classifiers have been proposed in previous work, these previous proposals focus on large networks, making their techniques suboptimal/impractical for tinyML applications. Our technique is optimized specifically for tiny-CNN sized models. In addition, we present a method to alleviate the effect of network overthinking by leveraging the representations learned by the early exit. We evaluate T-RecX on three CNNs from the MLPerf tiny benchmark suite for image classification, keyword spotting and visual wake word detection tasks. Our results show that T-RecX 1) improves the accuracy of baseline network, 2) achieves 31.58% average reduction in FLOPS in exchange for one percent accuracy across all evaluated models. Furthermore, we show that our methods consistently outperform popular prior works on the tiny-CNNs we evaluate.

updated: Wed Apr 26 2023 23:09:57 GMT+0000 (UTC)

published: Thu Jul 14 2022 02:05:43 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト