Basis Scaling and Double Pruning for Efficient Transfer Learning

Ken C. L. Wong; Satyananda Kashyap; Mehdi Moradi

効率的な転移学習のためのベーシススケーリングとダブルプルーニング

転移学習により、データが限られている新しいデータセットで深層学習機能を再利用できます。ただし、結果のモデルは不必要に大きくなり、非効率になる可能性があります。ネットワークプルーニングは推論効率を改善するために適用できますが、既存のアルゴリズムは通常微調整が必要であり、小さなデータセットには適していない場合があります。この論文では、畳み込み重みをモデルが剪定される正規直交基底の部分空間に変換するアルゴリズムを提案します。特異値分解を使用して、畳み込み層を2つの層に分解します。1つは正規直交基底ベクトルをフィルターとして持つ畳み込み層で、もう1つは「BasisScalingConv」という名前の層です。スペース。変換された各レイヤーのフィルターは線形独立であり、相対的な重要性がわかっているため、剪定はより効果的で安定しており、個々の重みを微調整する必要はありません。さらに、元の畳み込み層の入力チャネルと出力チャネルの数は変更されないため、基本プルーニングは事実上すべてのネットワークアーキテクチャに適用できます。ベーシスプルーニングは、ダブルプルーニング用の既存のプルーニングアルゴリズムと組み合わせて、プルーニング機能をさらに向上させることもできます。分類精度の低下が1％未満であるため、パラメーターで最大98.9％、FLOPで最大98.6％の剪定率を達成できます。

Transfer learning allows the reuse of deep learning features on new datasets with limited data. However, the resulting models could be unnecessarily large and thus inefficient. Although network pruning can be applied to improve inference efficiency, existing algorithms usually require fine-tuning and may not be suitable for small datasets. In this paper, we propose an algorithm that transforms the convolutional weights into the subspaces of orthonormal bases where a model is pruned. Using singular value decomposition, we decompose a convolutional layer into two layers: a convolutional layer with the orthonormal basis vectors as the filters, and a layer that we name "BasisScalingConv", which is responsible for rescaling the features and transforming them back to the original space. As the filters in each transformed layer are linearly independent with known relative importance, pruning can be more effective and stable, and fine tuning individual weights is unnecessary. Furthermore, as the numbers of input and output channels of the original convolutional layer remain unchanged, basis pruning is applicable to virtually all network architectures. Basis pruning can also be combined with existing pruning algorithms for double pruning to further increase the pruning capability. With less than 1% reduction in the classification accuracy, we can achieve pruning ratios up to 98.9% in parameters and 98.6% in FLOPs.

updated: Fri Aug 06 2021 00:04:02 GMT+0000 (UTC)

published: Fri Aug 06 2021 00:04:02 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト