ThreshNet: An Efficient DenseNet Using Threshold Mechanism to Reduce Connections

Rui-Yang Ju; Ting-Yu Lin; Jia-Hao Jian; Jen-Shiun Chiang; Wei-Bin Yang

ThreshNet: しきい値メカニズムを使用して接続を削減する効率的な DenseNet

コンピュータービジョンタスク用のニューラルネットワークの継続的な開発により、ますます多くのネットワークアーキテクチャが目覚ましい成功を収めています。最も高度なニューラルネットワークアーキテクチャの 1 つである DenseNet は、すべての機能マップをショートカットして、モデルの深さの問題を解決します。このネットワークアーキテクチャは、低いパラメーターで優れた精度を備えていますが、過度の推論時間が必要です。この問題を解決するために、HarDNet は特徴マップ間の接続を削減し、残りの接続を高調波に似せます。ただし、この圧縮方法では、モデルの精度が低下し、パラメータとモデルのサイズが増加する可能性があります。このネットワークアーキテクチャにより、メモリアクセス時間が短縮される可能性がありますが、全体的なパフォーマンスはさらに向上する可能性があります。したがって、しきい値メカニズムを使用して接続方法をさらに最適化する新しいネットワークアーキテクチャ、ThreshNet を提案します。ネットワークの推論を加速するために、さまざまな畳み込み層のさまざまな数の接続が破棄されます。提案されたネットワークは、NVIDIA RTX 3050 および Raspberry Pi 4 のプラットフォームで、CIFAR 10 および SVHN データセットを使用した画像分類で評価されています。実験結果は、HarDNet68、GhostNet、MobileNetV2、ShuffleNet、および EfficientNet と比較して、提案された ThreshNet79 は、それぞれ 5%、9%、10%、18%、20% 高速です。 ThreshNet95 のパラメーター数は、HarDNet85 のパラメーター数より 55% 少なくなっています。新しいモデルの圧縮とモデルの高速化方法により、推論時間が短縮され、ネットワークモデルがモバイルデバイスで動作できるようになります。

With the continuous development of neural networks for computer vision tasks, more and more network architectures have achieved outstanding success. As one of the most advanced neural network architectures, DenseNet shortcuts all feature maps to solve the model depth problem. Although this network architecture has excellent accuracy with low parameters, it requires an excessive inference time. To solve this problem, HarDNet reduces the connections between the feature maps, making the remaining connections resemble harmonic waves. However, this compression method may result in a decrease in the model accuracy and an increase in the parameters and model size. This network architecture may reduce the memory access time, but its overall performance can still be improved. Therefore, we propose a new network architecture, ThreshNet, using a threshold mechanism to further optimize the connection method. Different numbers of connections for different convolution layers are discarded to accelerate the inference of the network. The proposed network has been evaluated with image classification using CIFAR 10 and SVHN datasets under platforms of NVIDIA RTX 3050 and Raspberry Pi 4. The experimental results show that, compared with HarDNet68, GhostNet, MobileNetV2, ShuffleNet, and EfficientNet, the inference time of the proposed ThreshNet79 is 5%, 9%, 10%, 18%, and 20% faster, respectively. The number of parameters of ThreshNet95 is 55% less than that of HarDNet85. The new model compression and model acceleration methods can speed up the inference time, enabling network models to operate on mobile devices.

updated: Sun Aug 07 2022 12:33:45 GMT+0000 (UTC)

published: Sun Jan 09 2022 13:52:16 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト