Intermediate Prototype Mining Transformer for Few-Shot Semantic Segmentation

Yuanwei Liu; Nian Liu; Xiwen Yao; Junwei Han

少数ショットセマンティックセグメンテーション用の中間プロトタイプマイニングトランスフォーマー

少数ショットセマンティックセグメンテーションは、いくつかの注釈付きサポートイメージの条件下で、クエリ内のターゲットオブジェクトをセグメント化することを目的としています。以前のほとんどの作業では、サポートからより効果的なカテゴリ情報をマイニングして、クエリ内の対応するオブジェクトと照合しようとしています。ただし、それらはすべて、クエリイメージとサポートイメージの間のカテゴリ情報のギャップを無視しています。それらのオブジェクトが大きなクラス内多様性を示す場合、カテゴリ情報をサポートからクエリに強制的に移行しても効果がありません。この問題を解決するために、サポートからの決定論的カテゴリ情報とクエリからの適応カテゴリ知識の両方をマイニングするための中間プロトタイプを初めて導入しました。具体的には、反復的な方法でプロトタイプを学習するための中間プロトタイプマイニングトランスフォーマー (IPMT) を設計します。各 IPMT レイヤーでは、サポート機能とクエリ機能の両方のオブジェクト情報をプロトタイプに伝達し、それを使用してクエリ機能マップをアクティブにします。このプロセスを繰り返し実行することで、中間プロトタイプとクエリ機能の両方を徐々に改善できます。最後に、最終的なクエリ機能を使用して、正確なセグメンテーション予測を生成します。 PASCAL-5i と COCO-20i の両方のデータセットでの広範な実験により、IPMT の有効性が明確に検証され、以前の最先端の方法よりも大幅に優れていることが示されています。コードは https://github.com/LIUYUANWEI98/IPMT で入手できます

Few-shot semantic segmentation aims to segment the target objects in query under the condition of a few annotated support images. Most previous works strive to mine more effective category information from the support to match with the corresponding objects in query. However, they all ignored the category information gap between query and support images. If the objects in them show large intra-class diversity, forcibly migrating the category information from the support to the query is ineffective. To solve this problem, we are the first to introduce an intermediate prototype for mining both deterministic category information from the support and adaptive category knowledge from the query. Specifically, we design an Intermediate Prototype Mining Transformer (IPMT) to learn the prototype in an iterative way. In each IPMT layer, we propagate the object information in both support and query features to the prototype and then use it to activate the query feature map. By conducting this process iteratively, both the intermediate prototype and the query feature can be progressively improved. At last, the final query feature is used to yield precise segmentation prediction. Extensive experiments on both PASCAL-5i and COCO-20i datasets clearly verify the effectiveness of our IPMT and show that it outperforms previous state-of-the-art methods by a large margin. Code is available at https://github.com/LIUYUANWEI98/IPMT

updated: Thu Oct 13 2022 06:45:07 GMT+0000 (UTC)

published: Thu Oct 13 2022 06:45:07 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト