Pruning Very Deep Neural Network Channels for Efficient Inference

Yihui He

効率的な推論のための非常に深いニューラルネットワークチャネルの刈り込み

この論文では、非常に深い畳み込みニューラルネットワークを高速化するための新しいチャネルプルーニング手法を紹介します。トレーニング済みの CNN モデルが与えられた場合、LASSO 回帰ベースのチャネル選択と最小二乗再構成によって、各レイヤーを効果的にプルーニングするための反復 2 ステップアルゴリズムを提案します。このアルゴリズムをマルチレイヤーおよびマルチブランチのケースにさらに一般化します。私たちの方法は、累積誤差を減らし、さまざまなアーキテクチャとの互換性を高めます。私たちの剪定された VGG-16 は、5 倍のスピードアップとエラーのわずか 0.3% の増加によって最先端の結果を達成します。さらに重要なことは、私たちの方法は、ResNet、Xception などの最新のネットワークを高速化することができ、2 倍の高速化でそれぞれ 1.4%、1.0% の精度損失しか受けないことです。これは重要です。私たちのコードは公開されています。

In this paper, we introduce a new channel pruning method to accelerate very deep convolutional neural networks. Given a trained CNN model, we propose an iterative two-step algorithm to effectively prune each layer, by a LASSO regression based channel selection and least square reconstruction. We further generalize this algorithm to multi-layer and multi-branch cases. Our method reduces the accumulated error and enhances the compatibility with various architectures. Our pruned VGG-16 achieves the state-of-the-art results by 5x speed-up along with only 0.3% increase of error. More importantly, our method is able to accelerate modern networks like ResNet, Xception and suffers only 1.4%, 1.0% accuracy loss under 2x speed-up respectively, which is significant. Our code has been made publicly available.

updated: Mon Nov 14 2022 06:48:33 GMT+0000 (UTC)

published: Mon Nov 14 2022 06:48:33 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト