Neural Network Pruning with Residual-Connections and Limited-Data

Jian-Hao Luo; Jianxin Wu

残留結合と限定データを用いたニューラルネットワークのプルーニング

フィルターレベルのプルーニングは、ディープCNNモデルの推論速度を高速化するための有効な手法である。数多くのプルーニング・アルゴリズムが提案されているが、まだ2つの未解決の問題がある。1つ目の問題は、残留接続をどのようにプルーニングするかである。我々は、KL-divergenceベースの基準により、残留接続の内側と外側の両方のチャネルをプルーニングすることを提案する。2つ目の問題は、限られたデータでの枝刈りである。小さなデータセットで直接プルーニングを行うと、大きなデータセットでプルーニングされた小さなモデルやゼロから訓練されたモデルをファインチューニングするよりも、通常は悪い結果になるという興味深い現象が観察された。知識の蒸留は、限られたデータの弱点を補うための効果的なアプローチである。しかし、教師モデルのロジットにはノイズが含まれている可能性がある。この問題を解決するため、我々は、ラベルノイズの影響を避けるための、ラベル洗練アプローチを提案する。実験により、我々の手法(CURL、残留接続と限定データを用いた圧縮)の有効性が実証された。CURLはImageNetにおいて、これまでの最先端の手法を大幅に凌駕している。さらに重要なことは、小さなデータセットでプルーニングを行った場合、CURLは事前に学習した小さなモデルを微調整するのと同等かそれ以上の性能を達成していることである。

Filter level pruning is an effective method to accelerate the inference speed of deep CNN models. Although numerous pruning algorithms have been proposed, there are still two open issues. The first problem is how to prune residual connections. We propose to prune both channels inside and outside the residual connections via a KL-divergence based criterion. The second issue is pruning with limited data. We observe an interesting phenomenon: directly pruning on a small dataset is usually worse than fine-tuning a small model which is pruned or trained from scratch on the large dataset. Knowledge distillation is an effective approach to compensate for the weakness of limited data. However, the logits of a teacher model may be noisy. In order to avoid the influence of label noise, we propose a label refinement approach to solve this problem. Experiments have demonstrated the effectiveness of our method (CURL, Compression Using Residual-connections and Limited-data). CURL significantly outperforms previous state-of-the-art methods on ImageNet. More importantly, when pruning on small datasets, CURL achieves comparable or much better performance than fine-tuning a pretrained small model.

updated: Sat Apr 25 2020 08:02:47 GMT+0000 (UTC)

published: Tue Nov 19 2019 06:43:34 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト