Feature Embedding by Template Matching as a ResNet Block

Ada Gorgun; Yeti Z. Gurbuz; A. Aydin Alatan

ResNet ブロックとしてのテンプレートマッチングによる機能埋め込み

畳み込みブロックは局所特徴抽出器として機能し、ニューラルネットワークの成功の鍵となります。ローカルのセマンティック機能の埋め込みをより明示的にするために、最適な一致するカーネルに従って、畳み込みブロックを機能選択として再定式化します。このようにして、バッチ正規化 (BN) とそれに続く修正線形ユニット (ReLU) が arg-max オプティマイザーとして解釈されると、典型的な ResNet ブロックが実際にテンプレートマッチングを介して局所特徴埋め込みを実行することを示します。この観点に従って、ラベル情報を使用して意味的に意味のあるローカル機能の埋め込みを明示的に強制する残差ブロックを調整します。具体的には、対応する領域が一致するクラスに従って、各局所領域に特徴ベクトルを割り当てます。画像分類のためのいくつかのアーキテクチャを備えた 3 つの一般的なベンチマークデータセットでこの方法を評価し、このアプローチがベースラインアーキテクチャのパフォーマンスを大幅に向上させることを一貫して示しています。

Convolution blocks serve as local feature extractors and are the key to success of the neural networks. To make local semantic feature embedding rather explicit, we reformulate convolution blocks as feature selection according to the best matching kernel. In this manner, we show that typical ResNet blocks indeed perform local feature embedding via template matching once batch normalization (BN) followed by a rectified linear unit (ReLU) is interpreted as arg-max optimizer. Following this perspective, we tailor a residual block that explicitly forces semantically meaningful local feature embedding through using label information. Specifically, we assign a feature vector to each local region according to the classes that the corresponding region matches. We evaluate our method on three popular benchmark datasets with several architectures for image classification and consistently show that our approach substantially improves the performance of the baseline architectures.

updated: Tue Aug 15 2023 15:06:47 GMT+0000 (UTC)

published: Mon Oct 03 2022 14:58:17 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト