Interpretable Attention Guided Network for Fine-grained Visual Classification

Zhenhuan Huang; Xiaoyue Duan; Bo Zhao; Jinhu Lü; Baochang Zhang

きめ細かい視覚分類のための解釈可能な注意誘導ネットワーク

きめ細かい視覚的分類（FGVC）は困難ですが、従来の分類タスクよりも重要です。本質的に微妙なクラス内オブジェクトのバリエーションを使用して、さまざまなサブカテゴリを区別する必要があります。以前の作品は、注意戦略または境界ボックスに基づいて、複数の粒度と識別領域を使用して特徴表現能力を強化することに焦点を当てています。ただし、これらの方法は、解釈可能性に欠けるディープニューラルネットワークに大きく依存しています。きめ細かい視覚分類のための解釈可能な注意ガイドネットワーク（IAGN）を提案します。私たちの方法の貢献は次のとおりです。i）解釈可能な方法で識別領域を抽出するようにネットワークを導くことができる注意誘導フレームワーク。 ii）さまざまな粒度の機能を融合するために段階ごとに知識を抽出するために取得された進歩的なトレーニングメカニズム。 iii）いくつかの標準的なFGVCベンチマークデータセットで競争力のあるパフォーマンスを備えた最初の解釈可能なFGVCメソッド。

Fine-grained visual classification (FGVC) is challenging but more critical than traditional classification tasks. It requires distinguishing different subcategories with the inherently subtle intra-class object variations. Previous works focus on enhancing the feature representation ability using multiple granularities and discriminative regions based on the attention strategy or bounding boxes. However, these methods highly rely on deep neural networks which lack interpretability. We propose an Interpretable Attention Guided Network (IAGN) for fine-grained visual classification. The contributions of our method include: i) an attention guided framework which can guide the network to extract discriminitive regions in an interpretable way; ii) a progressive training mechanism obtained to distill knowledge stage by stage to fuse features of various granularities; iii) the first interpretable FGVC method with a competitive performance on several standard FGVC benchmark datasets.

updated: Tue Mar 09 2021 02:15:22 GMT+0000 (UTC)

published: Mon Mar 08 2021 12:27:51 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト