Learning Class-Agnostic Pseudo Mask Generation for Box-Supervised Semantic Segmentation

Chaohao Xie; Dongwei Ren; Lei Wang; Qinghua Hu; Liang Lin; Wangmeng Zuo

ボックス教師ありセマンティックセグメンテーションのための学習クラスにとらわれない疑似マスク生成

最近、いくつかの弱く監視された学習方法が、深いセマンティックセグメンテーションモデルをトレーニングするためにバウンディングボックス監視を利用することに専念しています。ほとんどの既存の方法は、通常、一般的な提案ジェネレーター（たとえば、高密度CRFおよびMCG）を利用して、セグメンテーションモデルをさらにトレーニングするための拡張セグメンテーションマスクを生成します。ただし、これらのプロポーザルジェネレータは汎用的であり、ボックス監視ありセグメンテーションセグメンテーション用に特別に設計されていないため、セグメンテーションのパフォーマンスを向上させる余地があります。この論文では、ボックス教師ありセマンティックセグメンテーションに合わせて調整された、より正確な学習ベースのクラスに依存しない疑似マスクジェネレータを探すことを目指しています。この目的のために、クラスラベルがボックス注釈付きデータセットのラベルと重複しない、ピクセルレベルの注釈付き補助データセットを使用します。補助データセットから疑似マスクジェネレータを学習するために、2レベルの最適化の定式化を示します。特に、下部のサブ問題は、ボックスで監視されたセマンティックセグメンテーションを学習するために使用され、上部のサブ問題は、最適なクラスに依存しない疑似マスクジェネレーターを学習するために使用されます。次に、学習した疑似セグメンテーションマスクジェネレータをボックス注釈付きデータセットに展開して、弱教師ありセグメンテーションセグメンテーションを改善できます。 PASCAL VOC 2012データセットでの実験は、学習した疑似マスクジェネレーターがセグメンテーションパフォーマンスの向上に効果的であることを示しており、私たちの方法は、ボックス監視モデルと完全監視モデルの間のパフォーマンスギャップをさらに埋めることができます。私たちのコードはhttps://github.com/Vious/LPG_BBox_Segmentationで公開されます。

Recently, several weakly supervised learning methods have been devoted to utilize bounding box supervision for training deep semantic segmentation models. Most existing methods usually leverage the generic proposal generators (e.g. , dense CRF and MCG) to produce enhanced segmentation masks for further training segmentation models. These proposal generators, however, are generic and not specifically designed for box-supervised semantic segmentation, thereby leaving some leeway for improving segmentation performance. In this paper, we aim at seeking for a more accurate learning-based class-agnostic pseudo mask generator tailored to box-supervised semantic segmentation. To this end, we resort to a pixel-level annotated auxiliary dataset where the class labels are non-overlapped with those of the box-annotated dataset. For learning pseudo mask generator from the auxiliary dataset, we present a bi-level optimization formulation. In particular, the lower subproblem is used to learn box-supervised semantic segmentation, while the upper subproblem is used to learn an optimal class-agnostic pseudo mask generator. The learned pseudo segmentation mask generator can then be deployed to the box-annotated dataset for improving weakly supervised semantic segmentation. Experiments on PASCAL VOC 2012 dataset show that the learned pseudo mask generator is effective in boosting segmentation performance, and our method can further close the performance gap between box-supervised and fully-supervised models. Our code will be made publicly available at https://github.com/Vious/LPG_BBox_Segmentation .

updated: Tue Mar 09 2021 14:54:54 GMT+0000 (UTC)

published: Tue Mar 09 2021 14:54:54 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト