Inner-Imaging Networks: Put Lenses into Convolutional Structure

Yang Hu; Guihua Wen; Mingnan Luo; Dan Dai; Wenming Cao; Zhiwen Yu; Wendy Hall

内部イメージングネットワーク：レンズを畳み込み構造に配置する

コンピュータビジョンで大きな成功を収めているにもかかわらず、深い畳み込みネットワークは、深刻な計算コストと冗長性に悩まされています。以前の研究では、フィルターの多様性を強化することでこの問題に取り組んでいますが、畳み込みネットワークの内部構造の補完性と完全性については考慮していません。これらの問題に対処するために、チャネル間の関係が上記の要件を満たすことを可能にする、新しい内部イメージングアーキテクチャがこの論文で提案されています。具体的には、畳み込みカーネルを使用してチャネル信号ポイントをグループに編成し、グループ内関係とグループ間関係の両方を同時にモデル化します。畳み込みフィルターは、空間関係をモデル化し、グループ化された信号を整理するための強力なツールであるため、提案された方法は、レンズを畳み込み内部構造に配置するように、チャネル信号を疑似画像にマッピングします。その結果、チャネルの多様性が向上するだけでなく、補完性と完全性も明示的に強化できます。提案されたアーキテクチャは軽量で、実装が簡単です。畳み込みネットワークの効率とパフォーマンスを向上させるために、効率的な自己組織化戦略を提供します。 CIFAR、SVHN、ImageNetなどの複数のベンチマーク画像認識データセットで広範な実験が行われます。実験結果は、バックボーンとして最も人気のある畳み込みネットワークを使用した内部イメージングメカニズムの有効性を検証します。

Despite the tremendous success in computer vision, deep convolutional networks suffer from serious computation costs and redundancies. Although previous works address this issue by enhancing diversities of filters, they have not considered the complementarity and the completeness of the internal structure of the convolutional network. To deal with these problems, a novel Inner-Imaging architecture is proposed in this paper, which allows relationships between channels to meet the above requirement. Specifically, we organize the channel signal points in groups using convolutional kernels to model both the intra-group and inter-group relationships simultaneously. The convolutional filter is a powerful tool for modeling spatial relations and organizing grouped signals, so the proposed methods map the channel signals onto a pseudo-image, like putting a lens into convolution internal structure. Consequently, not only the diversity of channels is increased, but also the complementarity and completeness can be explicitly enhanced. The proposed architecture is lightweight and easy to be implemented. It provides an efficient self-organization strategy for convolutional networks so as to improve their efficiency and performance. Extensive experiments are conducted on multiple benchmark image recognition data sets including CIFAR, SVHN and ImageNet. Experimental results verify the effectiveness of the Inner-Imaging mechanism with the most popular convolutional networks as the backbones.

updated: Fri Aug 27 2021 21:19:16 GMT+0000 (UTC)

published: Mon Apr 22 2019 16:44:10 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト