SASFormer: Transformers for Sparsely Annotated Semantic Segmentation

Hui Su; Yue Ye; Wei Hua; Lechao Cheng; Mingli Song

SASFormer: まばらに注釈が付けられたセマンティックセグメンテーション用のトランスフォーマー

近年、スパースアノテーションに基づくセマンティックセグメンテーションが進んでいます。画像内の各オブジェクトの一部のみにラベルを付け、残りの部分にはラベルを付けません。既存のアプローチのほとんどは時間がかかり、多くの場合、多段階のトレーニング戦略が必要です。この作業では、SASFormer と呼ばれる segformer に基づく、シンプルかつ効果的なスパースアノテーション付きセマンティックセグメンテーションフレームワークを提案します。具体的には、フレームワークは最初に階層的なパッチアテンションマップを生成し、ネットワーク予測を乗算して、有効なラベルで区切られた相関領域を生成します。さらに、相関結果の特徴とネットワーク予測の間の一貫性を確保するために、アフィニティ損失も導入します。広範な実験により、提案されたアプローチが既存の方法よりも優れており、最先端のパフォーマンスを達成することが示されています。ソースコードは、https://github.com/su-hui-zz/SASFormer で入手できます。

Semantic segmentation based on sparse annotation has advanced in recent years. It labels only part of each object in the image, leaving the remainder unlabeled. Most of the existing approaches are time-consuming and often necessitate a multi-stage training strategy. In this work, we propose a simple yet effective sparse annotated semantic segmentation framework based on segformer, dubbed SASFormer, that achieves remarkable performance. Specifically, the framework first generates hierarchical patch attention maps, which are then multiplied by the network predictions to produce correlated regions separated by valid labels. Besides, we also introduce the affinity loss to ensure consistency between the features of correlation results and network predictions. Extensive experiments showcase that our proposed approach is superior to existing methods and achieves cutting-edge performance. The source code is available at https://github.com/su-hui-zz/SASFormer.

updated: Mon Dec 05 2022 04:33:12 GMT+0000 (UTC)

published: Mon Dec 05 2022 04:33:12 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト