Two-stage Decision Improves Open-Set Panoptic Segmentation

Hai-Ming Xu; Hao Chen; Lingqiao Liu; Yufei Yin

2段階の決定により、開集合のパノプティコンセグメンテーションが改善されます

開集合パノプティコンセグメンテーション（OPS）問題は、\knownクラスと\unknownクラスの両方、つまりトレーニングセットで注釈が付けられないオブジェクト（「モノ」）のセグメンテーションを実行することを目的とした新しい研究の方向性です。 OPSの主な課題は2つあります。（1）未知のオブジェクトの出現の可能性が無限であるため、限られた数のトレーニングデータからそれらをモデル化することは困難です。（2）トレーニング時には、「不明なもの」と「背景」のクラスが本質的に混在する「void」カテゴリのみが提供されます。経験的に、スクリーニングなしで「void」カテゴリを直接使用して「既知のクラス」または「背景」を監視しても、満足のいくOPS結果が得られないことがわかりました。この論文では、分割統治法を提案して、OPSの2段階の意思決定プロセスを開発します。既知のクラス弁別子を追加のクラスに依存しないオブジェクト予測ヘッドと適切に組み合わせることにより、OPSパフォーマンスを大幅に改善できることを示します。具体的には、最初に、既知のカテゴリのみを使用して分類器を作成し、「void」クラスの提案でそれらのカテゴリからの予測確率を低くすることを提案します。次に、追加のオブジェクト予測ヘッドを使用して、「未知のもの」と背景を区別します。パフォーマンスをさらに向上させるために、最新のモデルから生成された「未知のもの」の疑似ラベルと、トレーニングセットを強化するためのヒューリスティックルールを導入します。私たちの広範な実験的評価は、私たちのアプローチが、既存の最良の方法よりも30％以上の相対的な改善で、未知のクラスのパノラマ品質を大幅に改善することを示しています。

Open-set panoptic segmentation (OPS) problem is a new research direction aiming to perform segmentation for both \known classes and \unknown classes, i.e., the objects ("things") that are never annotated in the training set. The main challenges of OPS are twofold: (1) the infinite possibility of the \unknown object appearances makes it difficult to model them from a limited number of training data. (2) at training time, we are only provided with the "void" category, which essentially mixes the "unknown thing" and "background" classes. We empirically find that directly using "void" category to supervise \known class or "background" without screening will not lead to a satisfied OPS result. In this paper, we propose a divide-and-conquer scheme to develop a two-stage decision process for OPS. We show that by properly combining a \known class discriminator with an additional class-agnostic object prediction head, the OPS performance can be significantly improved. Specifically, we first propose to create a classifier with only \known categories and let the "void" class proposals achieve low prediction probability from those categories. Then we distinguish the "unknown things" from the background by using the additional object prediction head. To further boost performance, we introduce "unknown things" pseudo-labels generated from up-to-date models and a heuristic rule to enrich the training set. Our extensive experimental evaluation shows that our approach significantly improves \unknown class panoptic quality, with more than 30% relative improvements than the existing best-performed method.

updated: Sat Jul 09 2022 03:29:49 GMT+0000 (UTC)

published: Wed Jul 06 2022 08:12:58 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト