Trainability Preserving Neural Pruning

Huan Wang; Yun Fu

トレーニング可能性を維持するニューラルプルーニング

最近の多くの研究では、トレーニング可能性がニューラルネットワークのプルーニングで中心的な役割を果たしていることが示されています。トレーニング可能性が無人で壊れていると、パフォーマンスが大幅に低下し、学習率の再トレーニングの影響が意図せずに増幅され、偏った (または誤解された) ベンチマーク結果が生じる可能性があります。このホワイトペーパーでは、訓練可能性を維持する剪定 (TPP) を紹介します。これは、剪定に対してネットワークの訓練可能性を維持するためのスケーラブルな方法であり、剪定パフォーマンスの向上を目指し、ハイパーパラメーター (学習率など) の再訓練に対してより堅牢になります。具体的には、畳み込みフィルターのグラム行列にペナルティを課して、保持されたフィルターからプルーニングされたフィルターを無相関にすることを提案します。畳み込み層に加えて、ネットワーク全体のトレーニング可能性を維持するという精神に従って、バッチ正規化パラメーター (スケールとバイアス) を正則化することも提案します。線形 MLP ネットワークに関する実証研究では、TPP がオラクルの訓練可能性回復スキームと同等のパフォーマンスを発揮できることが示されています。 CIFAR10/100 の非線形 ConvNets (ResNet56/VGG19) では、TPP は他の対応するアプローチより明らかに優れています。さらに、ResNets を使用した ImageNet-1K の結果は、TPP が一貫して、他のトップパフォーマンスの構造化プルーニングアプローチよりも有利に機能することを示唆しています。コード: https://github.com/MingSun-Tse/TPP。

Many recent works have shown trainability plays a central role in neural network pruning -- unattended broken trainability can lead to severe under-performance and unintentionally amplify the effect of retraining learning rate, resulting in biased (or even misinterpreted) benchmark results. This paper introduces trainability preserving pruning (TPP), a scalable method to preserve network trainability against pruning, aiming for improved pruning performance and being more robust to retraining hyper-parameters (e.g., learning rate). Specifically, we propose to penalize the gram matrix of convolutional filters to decorrelate the pruned filters from the retained filters. In addition to the convolutional layers, per the spirit of preserving the trainability of the whole network, we also propose to regularize the batch normalization parameters (scale and bias). Empirical studies on linear MLP networks show that TPP can perform on par with the oracle trainability recovery scheme. On nonlinear ConvNets (ResNet56/VGG19) on CIFAR10/100, TPP outperforms the other counterpart approaches by an obvious margin. Moreover, results on ImageNet-1K with ResNets suggest that TPP consistently performs more favorably against other top-performing structured pruning approaches. Code: https://github.com/MingSun-Tse/TPP.

updated: Fri Mar 03 2023 05:39:11 GMT+0000 (UTC)

published: Mon Jul 25 2022 21:15:47 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト