Pruning-aware Sparse Regularization for Network Pruning

Nanfei Jiang; Xu Zhao; Chaoyang Zhao; Yongqi An; Ming Tang; Jinqiao Wang

ネットワークプルーニングのためのプルーニング対応のスパース正則化

構造的ニューラルネットワークの剪定は、最終的な出力精度にとってそれほど重要ではないフィルターを剪定することにより、深い畳み込みニューラルネットワーク（CNN）の冗長チャネルを削除することを目的としています。剪定後のパフォーマンスの低下を減らすために、多くの方法では、スパース正則化による損失を利用して、構造化されたスパース性を生成します。この論文では、これらのスパース性トレーニングベースの方法を分析し、剪定されていないチャネルの正則化が不要であることを発見しました。さらに、ネットワークの容量が制限されるため、フィッティングが不十分になります。この問題を解決するために、剪定を意識したスパース正則化を使用した、MaskSparsityという名前の新しい剪定方法を提案します。 MaskSparsityは、モデルのすべてのフィルターではなく、プルーニングマスクによって選択された特定のフィルターにきめ細かいスパース正則化を課します。 MaskSparityのきめ細かいスパース正則化の前に、グローバルスパース正則化の実行など、多くのメソッドを使用してプルーニングマスクを取得できます。 MaskSparsityは、パラメーターの60.34％を削除することにより、ResNet-110で63.03％-FLOPsの削減を達成し、CIFAR-10でトップ1の精度を失うことはありません。 ILSVRC-2012では、MaskSparsityはResNet-50で51.07％以上のFLOPを削減し、上位1の精度で0.76％の損失しかありません。コードはhttps://github.com/CASIA-IVA-Lab/MaskSparsityでリリースされています。さらに、MaskSparityのコードをPyTorchプルーニングツールキットであるEasyPruner（https://gitee.com/casia_iva_engineer/easypruner）に統合しました。

Structural neural network pruning aims to remove the redundant channels in the deep convolutional neural networks (CNNs) by pruning the filters of less importance to the final output accuracy. To reduce the degradation of performance after pruning, many methods utilize the loss with sparse regularization to produce structured sparsity. In this paper, we analyze these sparsity-training-based methods and find that the regularization of unpruned channels is unnecessary. Moreover, it restricts the network's capacity, which leads to under-fitting. To solve this problem, we propose a novel pruning method, named MaskSparsity, with pruning-aware sparse regularization. MaskSparsity imposes the fine-grained sparse regularization on the specific filters selected by a pruning mask, rather than all the filters of the model. Before the fine-grained sparse regularization of MaskSparity, we can use many methods to get the pruning mask, such as running the global sparse regularization. MaskSparsity achieves 63.03%-FLOPs reduction on ResNet-110 by removing 60.34% of the parameters, with no top-1 accuracy loss on CIFAR-10. On ILSVRC-2012, MaskSparsity reduces more than 51.07% FLOPs on ResNet-50, with only a loss of 0.76% in the top-1 accuracy. The code is released at https://github.com/CASIA-IVA-Lab/MaskSparsity. Moreover, we have integrated the code of MaskSparity into a PyTorch pruning toolkit, EasyPruner, at https://gitee.com/casia_iva_engineer/easypruner.

updated: Tue Jan 18 2022 07:19:23 GMT+0000 (UTC)

published: Tue Jan 18 2022 07:19:23 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト