Connectivity Matters: Neural Network Pruning Through the Lens of Effective Sparsity

Artem Vysogorets; Julia Kempe

接続性の問題：効果的なスパース性のレンズを介したニューラルネットワークのプルーニング

ニューラルネットワークの剪定は、スパース性の高い体制への関心が急上昇している実り多い研究分野です。このドメインでのベンチマークは、サブネットワークのスパース性の忠実な表現に大きく依存しています。これは、従来、削除された接続の割合（直接スパース性）として計算されていました。ただし、この定義では、基盤となるサブネットワークの入力層または出力層から切り離された、削除されていないパラメーターを認識できず、実際の有効なスパース性（非アクティブ化された接続の割合）を過小評価する可能性があります。この影響は、中程度にプルーニングされたネットワーク（最大10〜100の圧縮率）では無視できる可能性がありますが、より薄いサブネットワークでは役割が大きくなり、異なるプルーニングアルゴリズム間の比較が大きく歪むことがわかります。たとえば、ランダムに剪定されたLeNet-300-100の効果的な圧縮は、直接の対応物よりも桁違いに大きくなる可能性がありますが、剪定にSynFlowを使用した場合に差異は観察されません[Tanaka et al。、2020]。この作業では、効果的なスパース性のレンズを採用して、一般的なベンチマークアーキテクチャ（LeNet-300-100、VGG-19、ResNet-18など）で最近のいくつかのプルーニングアルゴリズムを再評価し、絶対的および相対的なパフォーマンスがこの中で劇的に変化することを発見します新しい、より適切なフレームワーク。直接的なスパース性ではなく効果的なスパース性を目指すために、ほとんどの剪定アルゴリズムに対して低コストの拡張機能を開発します。さらに、参照フレームとして有効なスパース性を備えており、レイヤー間で適切なスパース性を割り当てたランダムプルーニングが、初期化時にプルーニングするためのより高度なアルゴリズムと同等またはそれ以上に機能することを部分的に再確認します[Su et al。、2020]。この観察に応えて、物理学からの結合シリンダー内の圧力分布の単純なアナロジーを使用して、ランダム剪定のコンテキストで既存のすべてのベースラインを上回る新しいレイヤーごとのスパース性クォータを設計します。

Neural network pruning is a fruitful area of research with surging interest in high sparsity regimes. Benchmarking in this domain heavily relies on faithful representation of the sparsity of subnetworks, which has been traditionally computed as the fraction of removed connections (direct sparsity). This definition, however, fails to recognize unpruned parameters that detached from input or output layers of underlying subnetworks, potentially underestimating actual effective sparsity: the fraction of inactivated connections. While this effect might be negligible for moderately pruned networks (up to 10-100 compression rates), we find that it plays an increasing role for thinner subnetworks, greatly distorting comparison between different pruning algorithms. For example, we show that effective compression of a randomly pruned LeNet-300-100 can be orders of magnitude larger than its direct counterpart, while no discrepancy is ever observed when using SynFlow for pruning [Tanaka et al., 2020]. In this work, we adopt the lens of effective sparsity to reevaluate several recent pruning algorithms on common benchmark architectures (e.g., LeNet-300-100, VGG-19, ResNet-18) and discover that their absolute and relative performance changes dramatically in this new and more appropriate framework. To aim for effective, rather than direct, sparsity, we develop a low-cost extension to most pruning algorithms. Further, equipped with effective sparsity as a reference frame, we partially reconfirm that random pruning with appropriate sparsity allocation across layers performs as well or better than more sophisticated algorithms for pruning at initialization [Su et al., 2020]. In response to this observation, using a simple analogy of pressure distribution in coupled cylinders from physics, we design novel layerwise sparsity quotas that outperform all existing baselines in the context of random pruning.

updated: Sat Apr 08 2023 01:04:39 GMT+0000 (UTC)

published: Mon Jul 05 2021 22:36:57 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト