Global Meets Local: Effective Multi-Label Image Classification via Category-Aware Weak Supervision

Jiawei Zhan; Jun Liu; Wei Tang; Guannan Jiang; Xi Wang; Bin-Bin Gao; Tianliang Zhang; Wenlong Wu; Wei Zhang; Chengjie Wang; Yuan Xie

グローバルとローカルの出会い: カテゴリを意識した弱い教師による効果的なマルチラベル画像分類

ラベル依存性と領域ベースの方法に分類できるマルチラベル画像分類は、基になるオブジェクトのレイアウトが複雑であるため、困難な問題です。領域ベースの方法は、ラベル依存の方法よりもモデルの一般化に関する問題に遭遇する可能性が低くなりますが、無差別な情報を含む何百もの無意味またはノイズの多い提案を生成することが多く、ローカライズされた領域間のコンテキスト依存性はしばしば無視されるか過度に単純化されます。 .この論文では、効果的なノイジー提案抑制を実行し、ロバストな機能学習のためにグローバル機能とローカル機能の間で相互作用する統合フレームワークを構築します。具体的には、存在しないカテゴリに集中してローカル機能学習に決定論的な情報を提供し、ローカルブランチがより高品質の関心領域に集中するように制限する、カテゴリ対応の弱い監督を提案します。さらに、グローバルとローカルの特徴間の補完的な情報を探索するためのクロスグラニュラリティアテンションモジュールを開発します。これにより、グローバルとローカルの関係だけでなく、ローカルとローカルの関係も含む高次の特徴相関を構築できます。両方の利点により、ネットワーク全体のパフォーマンスが向上します。 2 つの大規模なデータセット (MS-COCO と VOC 2007) での広範な実験は、私たちのフレームワークが最先端の方法よりも優れたパフォーマンスを達成することを示しています。

Multi-label image classification, which can be categorized into label-dependency and region-based methods, is a challenging problem due to the complex underlying object layouts. Although region-based methods are less likely to encounter issues with model generalizability than label-dependency methods, they often generate hundreds of meaningless or noisy proposals with non-discriminative information, and the contextual dependency among the localized regions is often ignored or over-simplified. This paper builds a unified framework to perform effective noisy-proposal suppression and to interact between global and local features for robust feature learning. Specifically, we propose category-aware weak supervision to concentrate on non-existent categories so as to provide deterministic information for local feature learning, restricting the local branch to focus on more high-quality regions of interest. Moreover, we develop a cross-granularity attention module to explore the complementary information between global and local features, which can build the high-order feature correlation containing not only global-to-local, but also local-to-local relations. Both advantages guarantee a boost in the performance of the whole network. Extensive experiments on two large-scale datasets (MS-COCO and VOC 2007) demonstrate that our framework achieves superior performance over state-of-the-art methods.

updated: Wed Nov 23 2022 05:39:17 GMT+0000 (UTC)

published: Wed Nov 23 2022 05:39:17 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト