Filter Pruning based on Information Capacity and Independence

Xiaolong Tang; Tianheng Hu; Yufeng Shi

情報容量と独立性に基づいたフィルター刈り込み

フィルターの刈り込みは、畳み込みニューラルネットワーク (CNN) の圧縮と高速化に広く使用されています。ただし、ほとんどの既存の方法は、計算コストが高く、フィルターの選択が偏っているという課題が残っています。さらに、フィルター評価のほとんどの設計は、適切な理論的ガイダンスがないため、解釈可能性を欠いています。この論文では、解釈可能で、マルチパースペクティブで、データフリーの方法でフィルターを評価する新しいフィルター枝刈り法を提案します。フィルターに含まれる情報の量を表すメトリックである情報容量を導入します。情報エントロピーの解釈可能性と妥当性に基づいて、それを情報量の定量的指標として使用することを提案します。さらに、フィルターの情報容量を測定するための解釈可能なデータ駆動型スキームを提案するために、特徴マップのエントロピーと対応するフィルターとの間の明らかな相関関係を実験的に示します。さらに、情報の独立性を導入します。これは、異なるフィルター間の相関関係を表す別のメトリックです。その結果、情報容量が少なく、情報の独立性が低い、最も重要性の低いフィルタが削除されます。 VGG-16 や ResNet など、複数の代表的な CNN アーキテクチャを使用して、2 つのベンチマークで手法を評価します。 CIFAR-10 では、浮動小数点演算 (FLOP) の 71.9% を削減し、ResNet-110 のパラメーターの 69.4% を 0.28% の精度向上で削減しました。 ILSVRC-2012 では、浮動小数点演算 (FLOP) の 76.6% と ResNet-50 のパラメーターの 68.6% を 2.80% の精度低下で削減し、最先端技術を凌駕しています。

Filter pruning has been widely used in the compression and acceleration of convolutional neural networks (CNNs). However, most existing methods are still challenged by heavy compute cost and biased filter selection. Moreover, most designs for filter evaluation miss interpretability due to the lack of appropriate theoretical guidance. In this paper, we propose a novel filter pruning method which evaluates filters in a interpretable, multi-persepective and data-free manner. We introduce information capacity, a metric that represents the amount of information contained in a filter. Based on the interpretability and validity of information entropy, we propose to use that as a quantitative index of information quantity. Besides, we experimently show that the obvious correlation between the entropy of the feature map and the corresponding filter, so as to propose an interpretable, data-driven scheme to measure the information capacity of the filter. Further, we introduce information independence, another metric that represents the correlation among differrent filters. Consequently, the least impotant filters, which have less information capacity and less information independence, will be pruned. We evaluate our method on two benchmarks using multiple representative CNN architectures, including VGG-16 and ResNet. On CIFAR-10, we reduce 71.9% of floating-point operations (FLOPs) and 69.4% of parameters for ResNet-110 with 0.28% accuracy increase. On ILSVRC-2012, we reduce 76.6% of floating-point operations (FLOPs) and 68.6% of parameters for ResNet-50 with only 2.80% accuracy decrease, which outperforms the state-of-the-arts.

updated: Tue Mar 07 2023 04:26:44 GMT+0000 (UTC)

published: Tue Mar 07 2023 04:26:44 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト