Positive-Negative Equal Contrastive Loss for Semantic Segmentation

Jing Wang; Lingfei Xuan; Wenxuan Wang; Tianxiang Zhang; Jiangyun Li

セマンティックセグメンテーションの正負の等しい対照損失

コンテキスト情報は、さまざまなコンピュータービジョンタスクにとって重要です。以前の作業では、一般にプラグアンドプレイモジュールと構造損失を設計して、グローバルコンテキストを効果的に抽出および集約します。これらの方法では、ファインラベルを使用してモデルを最適化しますが、ファイントレーニングされた機能も貴重なトレーニングリソースであり、ハードピクセル（つまり、誤って分類されたピクセル）に好ましい分布を導入する可能性があることを無視します。教師なしパラダイムでの対照学習に触発されて、教師あり方法で対照損失を適用し、損失関数を再設計して、教師なし学習のステレオタイプ（たとえば、ポジティブとネガティブの不均衡、アンカーコンピューティングの混乱）を排除します。この目的のために、アンカーへの正の埋め込みの潜在的な影響を増大させ、正と負のサンプルペアを同等に扱う、正-負の等しい対照損失（PNE損失）を提案します。 PNEの損失は、既存のセマンティックセグメンテーションフレームワークに直接プラグインでき、無視できる余分な計算コストで優れたパフォーマンスを実現します。多数の従来のセグメンテーション手法（DeepLabV3、OCRNet、UperNetなど）とバックボーン（ResNet、HRNet、Swin Transformerなど）を利用して、包括的な実験を実施し、2つのベンチマークデータセット（例、都市の景観とCOCO-スタッフ）。私たちのコードはまもなく公開されます。

The contextual information is critical for various computer vision tasks, previous works commonly design plug-and-play modules and structural losses to effectively extract and aggregate the global context. These methods utilize fine-label to optimize the model but ignore that fine-trained features are also precious training resources, which can introduce preferable distribution to hard pixels (i.e., misclassified pixels). Inspired by contrastive learning in unsupervised paradigm, we apply the contrastive loss in a supervised manner and re-design the loss function to cast off the stereotype of unsupervised learning (e.g., imbalance of positives and negatives, confusion of anchors computing). To this end, we propose Positive-Negative Equal contrastive loss (PNE loss), which increases the latent impact of positive embedding on the anchor and treats the positive as well as negative sample pairs equally. The PNE loss can be directly plugged right into existing semantic segmentation frameworks and leads to excellent performance with neglectable extra computational costs. We utilize a number of classic segmentation methods (e.g., DeepLabV3, OCRNet, UperNet) and backbone (e.g., ResNet, HRNet, Swin Transformer) to conduct comprehensive experiments and achieve state-of-the-art performance on two benchmark datasets (e.g., Cityscapes and COCO-Stuff). Our code will be publicly available soon.

updated: Tue Jul 05 2022 03:12:56 GMT+0000 (UTC)

published: Mon Jul 04 2022 13:51:29 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト