High-fidelity Pseudo-labels for Boosting Weakly-Supervised Segmentation

Arvi Jonnarth; Yushan Zhang; Michael Felsberg

弱い教師ありセグメンテーションを強化するための忠実度の高い疑似ラベル

画像レベルの弱教師セマンティックセグメンテーション (WSSS) のタスクは、セグメンテーションモデルをトレーニングするための膨大なデータアノテーションコストを削減するため、近年人気を博しています。 WSSS の典型的なアプローチには、畳み込み特徴マップでグローバル平均プーリング (GAP) を使用して画像分類ネットワークをトレーニングすることが含まれます。これにより、画像領域の重要性を識別するクラス活性化マップ (CAM) に基づいてオブジェクトの位置を推定できます。次に、CAM を使用してセグメンテーションマスクの形式で疑似ラベルを生成し、ピクセルレベルのグラウンドトゥルースがない場合にセグメンテーションモデルを監視します。 SEAM ベースラインの場合、以前の研究では次の 2 つの方法で CAM 学習を改善することが提案されました。(1) GAP の代わりとなる重要度サンプリング、および (2) オブジェクトの輪郭をほぼ排他的に輪郭化するヒューリスティックを利用する特徴類似性損失。画像のカラーエッジに合わせます。この作業では、これらの手法に対して CAM の異なる確率論的解釈を提案し、可能性を多項事後分布よりも適切にします。その結果、本質的に以前の WSSS メソッドを後押しできるアドオンメソッドを提案し、実装されているすべての最先端のベースラインの領域の類似性と輪郭の品質の両方を改善します。これは、PASCAL VOC データセットのさまざまなベースラインで実証されています。 MS COCO データセットの実験では、大規模な設定でもパフォーマンスが向上することが示されています。コードは https://github.com/arvijj/hfpl で入手できます。

The task of image-level weakly-supervised semantic segmentation (WSSS) has gained popularity in recent years, as it reduces the vast data annotation cost for training segmentation models. The typical approach for WSSS involves training an image classification network using global average pooling (GAP) on convolutional feature maps. This enables the estimation of object locations based on class activation maps (CAMs), which identify the importance of image regions. The CAMs are then used to generate pseudo-labels, in the form of segmentation masks, to supervise a segmentation model in the absence of pixel-level ground truth. In case of the SEAM baseline, a previous work proposed to improve CAM learning in two ways: (1) Importance sampling, which is a substitute for GAP, and (2) the feature similarity loss, which utilizes a heuristic that object contours almost exclusively align with color edges in images. In this work, we propose a different probabilistic interpretation of CAMs for these techniques, rendering the likelihood more appropriate than the multinomial posterior. As a result, we propose an add-on method that can boost essentially any previous WSSS method, improving both the region similarity and contour quality of all implemented state-of-the-art baselines. This is demonstrated on a wide variety of baselines on the PASCAL VOC dataset. Experiments on the MS COCO dataset show that performance gains can also be achieved in a large-scale setting. Our code is available at https://github.com/arvijj/hfpl.

updated: Wed Apr 05 2023 17:43:57 GMT+0000 (UTC)

published: Wed Apr 05 2023 17:43:57 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト