Improve Convolutional Neural Network Pruning by Maximizing Filter Variety

Nathan Hubens; Matei Mancas; Bernard Gosselin; Marius Preda; Titus Zaharia

フィルタの多様性を最大化することにより、畳み込みニューラルネットワークの剪定を改善します

ニューラルネットワークの剪定は、モデルのストレージとコンピューティングの要件を減らすために広く使用されている戦略です。重みにスパース性を導入することにより、ネットワークの複雑さを軽減できます。スパース行列を利用することは依然として困難であるため、剪定は構造化された方法で実行されることがよくあります。つまり、ConvNetの場合は、選択した剪定基準に従って畳み込みフィルター全体を削除します。 l1-normやmovementなどの一般的な剪定基準は、通常、フィルターの個々の有用性を考慮していません。これにより、（1）まれな、したがって重要で識別可能な動作を示すフィルターの削除、および（2）冗長な情報を持つフィルター。この論文では、これら2つの問題を解決する手法を紹介します。これらの手法は、任意の剪定基準に追加できます。この手法により、選択基準が冗長フィルターに焦点を合わせ、まれなフィルターを保持することで、残りのフィルターの多様性を最大限に高めることができます。さまざまなデータセット（CIFAR-10、CIFAR-100、およびCALTECH-101）で実行され、さまざまなアーキテクチャ（VGG-16およびResNet-18）を使用した実験結果は、より高いパフォーマンスを維持しながら、同様のスパース性レベルを達成できることを示しています。剪定基準にフィルター選択手法を追加する場合。さらに、宝くじの仮説を適用することにより、見つかったまばらなサブネットワークの品質を評価し、この方法を追加すると、ほとんどの場合、パフォーマンスの高いチケットを見つけることができることがわかります。

Neural network pruning is a widely used strategy for reducing model storage and computing requirements. It allows to lower the complexity of the network by introducing sparsity in the weights. Because taking advantage of sparse matrices is still challenging, pruning is often performed in a structured way, i.e. removing entire convolution filters in the case of ConvNets, according to a chosen pruning criteria. Common pruning criteria, such as l1-norm or movement, usually do not consider the individual utility of filters, which may lead to: (1) the removal of filters exhibiting rare, thus important and discriminative behaviour, and (2) the retaining of filters with redundant information. In this paper, we present a technique solving those two issues, and which can be appended to any pruning criteria. This technique ensures that the criteria of selection focuses on redundant filters, while retaining the rare ones, thus maximizing the variety of remaining filters. The experimental results, carried out on different datasets (CIFAR-10, CIFAR-100 and CALTECH-101) and using different architectures (VGG-16 and ResNet-18) demonstrate that it is possible to achieve similar sparsity levels while maintaining a higher performance when appending our filter selection technique to pruning criteria. Moreover, we assess the quality of the found sparse sub-networks by applying the Lottery Ticket Hypothesis and find that the addition of our method allows to discover better performing tickets in most cases

updated: Fri Mar 11 2022 09:00:59 GMT+0000 (UTC)

published: Fri Mar 11 2022 09:00:59 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト