Affinity Attention Graph Neural Network for Weakly Supervised Semantic Segmentation

Bingfeng Zhang; Jimin Xiao; Jianbo Jiao; Yunchao Wei; Yao Zhao

弱い教師ありセマンティックセグメンテーションのためのアフィニティアテンショングラフニューラルネットワーク

弱い教師ありセマンティックセグメンテーションは、人によるアノテーションのコストが低いため、大きな注目を集めています。この論文では、境界ボックスの教師ありセマンティックセグメンテーション、つまり、監視としてバウンディングボックスの注釈を使用して正確なセマンティックセグメンテーションモデルをトレーニングすることを目指しています。この目的のために、アフィニティアテンショングラフニューラルネットワーク (A^2GNN) を提案します。以前のプラクティスに従って、最初に疑似セマンティック認識シードを生成し、次に、新しく提案されたアフィニティ畳み込みニューラルネットワーク (CNN) に基づいてセマンティックグラフに形成します。次に、構築されたグラフが A^2GNN に入力されます。この A^2GNN では、アフィニティアテンションレイヤーが、ソフトグラフエッジから短距離および長距離情報を取得して、信頼できるシードからラベルのないピクセルにセマンティックラベルを正確に伝播するように設計されています。ただし、シードの精度を保証するために、A^2GNN には限られた数の信頼できるピクセルシードラベルのみを採用しているため、トレーニングの監督が不十分になる可能性があります。この問題を軽減するために、新しい損失関数と、境界ボックスの制約を活用する整合性チェックメカニズムをさらに導入して、モデルの最適化により信頼性の高いガイダンスを含めることができるようにします。実験では、私たちのアプローチが Pascal VOC 2012 データセットで新しい最先端のパフォーマンスを達成することが示されています (値: 76.5%、テスト: 75.2%)。さらに重要なことに、私たちのアプローチは、バウンディングボックスの教師ありインスタンスセグメンテーションタスクまたはその他の弱い教師ありセマンティックセグメンテーションタスクに容易に適用でき、PASCAL VOC または COCO データセットのすべての弱い教師ありタスク間で最先端または同等のパフォーマンスを実現できます。ソースコードは https://github.com/zbf1991/A2GNN で入手できます。

Weakly supervised semantic segmentation is receiving great attention due to its low human annotation cost. In this paper, we aim to tackle bounding box supervised semantic segmentation, i.e., training accurate semantic segmentation models using bounding box annotations as supervision. To this end, we propose Affinity Attention Graph Neural Network (A^2GNN). Following previous practices, we first generate pseudo semantic-aware seeds, which are then formed into semantic graphs based on our newly proposed affinity Convolutional Neural Network (CNN). Then the built graphs are input to our A^2GNN, in which an affinity attention layer is designed to acquire the short- and long- distance information from soft graph edges to accurately propagate semantic labels from the confident seeds to the unlabeled pixels. However, to guarantee the precision of the seeds, we only adopt a limited number of confident pixel seed labels for A^2GNN, which may lead to insufficient supervision for training. To alleviate this issue, we further introduce a new loss function and a consistency-checking mechanism to leverage the bounding box constraint, so that more reliable guidance can be included for the model optimization. Experiments show that our approach achieves new state-of-the-art performances on Pascal VOC 2012 datasets (val: 76.5%, test: 75.2%). More importantly, our approach can be readily applied to bounding box supervised instance segmentation task or other weakly supervised semantic segmentation tasks, with state-of-the-art or comparable performance among almot all weakly supervised tasks on PASCAL VOC or COCO dataset. Our source code will be available at https://github.com/zbf1991/A2GNN.

updated: Tue Jun 08 2021 02:19:21 GMT+0000 (UTC)

published: Tue Jun 08 2021 02:19:21 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト