Revisiting Dynamic Convolution via Matrix Decomposition

Yunsheng Li; Yinpeng Chen; Xiyang Dai; Mengchen Liu; Dongdong Chen; Ye Yu; Lu Yuan; Zicheng Liu; Mei Chen; Nuno Vasconcelos

行列分解による動的畳み込みの再考

動的畳み込みに関する最近の研究では、K個の静的畳み込みカーネルの適応型集約により、効率的なCNNのパフォーマンスが大幅に向上することが示されています。これには2つの制限があります。（a）畳み込み重みの数をK倍に増やすこと、および（b）動的注意と静的畳み込みカーネルの共同最適化が難しいことです。この論文では、行列分解の新しい観点からそれを再検討し、重要な問題は、動的畳み込みが高次元の潜在空間に投影した後、チャネルグループに動的注意を適用することであることを明らかにします。この問題に対処するために、チャネルグループに対する動的な注意を置き換える動的なチャネル融合を提案します。動的チャネル融合は、潜在空間の大幅な次元削減を可能にするだけでなく、共同最適化の難しさを軽減します。その結果、私たちの方法はトレーニングが簡単で、精度を犠牲にすることなく必要なパラメーターが大幅に少なくなります。ソースコードはhttps://github.com/liyunsheng13/dcdにあります。

Recent research in dynamic convolution shows substantial performance boost for efficient CNNs, due to the adaptive aggregation of K static convolution kernels. It has two limitations: (a) it increases the number of convolutional weights by K-times, and (b) the joint optimization of dynamic attention and static convolution kernels is challenging. In this paper, we revisit it from a new perspective of matrix decomposition and reveal the key issue is that dynamic convolution applies dynamic attention over channel groups after projecting into a higher dimensional latent space. To address this issue, we propose dynamic channel fusion to replace dynamic attention over channel groups. Dynamic channel fusion not only enables significant dimension reduction of the latent space, but also mitigates the joint optimization difficulty. As a result, our method is easier to train and requires significantly fewer parameters without sacrificing accuracy. Source code is at https://github.com/liyunsheng13/dcd.

updated: Mon Mar 15 2021 23:03:18 GMT+0000 (UTC)

published: Mon Mar 15 2021 23:03:18 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト