Towards Ignoring Backgrounds and Improving Generalization: a Costless DNN Visual Attention Mechanism

Pedro R. A. S. Bassi; Sergio S. J. Dertkigil; Andrea Cavalli

背景の無視と一般化の向上に向けて: コストのかからない DNN ビジュアルアテンションメカニズム

この研究では、画像分類器のアテンションメカニズムと、ISNet と呼ばれる対応するディープニューラルネットワーク (DNN) アーキテクチャが導入されています。トレーニング中に、ISNet はセグメンテーションターゲットを使用して、画像の関心領域を見つけてそこに注意を集中する方法を学習します。この提案は、LRP 説明ヒートマップにおける背景関連性の最小化という新しい概念に基づいています。実行時に追加の計算コストをかけずに、事実上あらゆる分類ニューラルネットワークアーキテクチャに適用できます。バックグラウンドを無視できるため、結果として得られる単一の DNN は、セグメンターとそれに続く分類器の一般的なパイプラインを置き換えることができ、より高速かつ軽量になります。（さまざまなアプリケーションで）画像の背景に合成バイアスを注入した後、ISNet を複数の最先端のニューラルネットワークと比較し、分類器の決定に対するバイアスの影響を最小限に抑える優れた能力を定量的に実証します。胸部 X 線写真による新型コロナウイルス感染症と結核の検出タスクでは、一般に混合トレーニングデータベースが使用され、これにより背景のバイアスと近道学習が自然に促進されます。 ISNet は肺に焦点を当てることでショートカット学習を削減し、外部 (配布外) テストデータセットに対する一般化が大幅に向上しました。 ISNet は、背景を無視して一般化を向上させる、正確、高速、軽量の方法論を提供します。

This work introduces an attention mechanism for image classifiers and the corresponding deep neural network (DNN) architecture, dubbed ISNet. During training, the ISNet uses segmentation targets to learn how to find the image's region of interest and concentrate its attention on it. The proposal is based on a novel concept, background relevance minimization in LRP explanation heatmaps. It can be applied to virtually any classification neural network architecture, without any extra computational cost at run-time. Capable of ignoring the background, the resulting single DNN can substitute the common pipeline of a segmenter followed by a classifier, being faster and lighter. After injecting synthetic bias in images' backgrounds (in diverse applications), we compare the ISNet to multiple state-of-the-art neural networks, and quantitatively demonstrate its superior capacity of minimizing the bias influence over the classifier decisions. The tasks of COVID-19 and tuberculosis detection in chest X-rays commonly employ mixed training databases, which naturally foster background bias and shortcut learning. By focusing on lungs, the ISNet reduced shortcut learning, leading to significantly superior generalization to external (out-of-distribution) test datasets. ISNet presents an accurate, fast, and light methodology to ignore backgrounds and improve generalization.

updated: Fri Jun 23 2023 22:41:32 GMT+0000 (UTC)

published: Tue Feb 01 2022 05:58:01 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト