Fully Convolutional Networks for Panoptic Segmentation with Point-based Supervision

Yanwei Li; Hengshuang Zhao; Xiaojuan Qi; Yukang Chen; Lu Qi; Liwei Wang; Zeming Li; Jian Sun; Jiaya Jia

ポイントベースの監視によるパノプティックセグメンテーションのための完全畳み込みネットワーク

この論文では、パノプティック FCN と呼ばれる、完全および弱教師付きパノプティックセグメンテーションのための概念的にシンプルで強力かつ効率的なフレームワークを提示します。私たちのアプローチは、ポイントベースの完全または弱い監視で最適化できる、統合された完全な畳み込みパイプラインでフォアグラウンドのものとバックグラウンドのものを表現および予測することを目的としています。特に、Panoptic FCN は、提案されたカーネルジェネレーターを使用して各オブジェクトインスタンスまたはスタッフカテゴリをエンコードし、高解像度の特徴を直接畳み込むことによって予測を生成します。このアプローチを使用すると、モノとモノのインスタンス対応で意味的に一貫したプロパティを、単純な「カーネルを生成してからセグメント化する」ワークフローでそれぞれ満たすことができます。ローカリゼーションまたはインスタンス分離のための追加のボックスがないため、提案されたアプローチは、以前のボックスベースのモデルやボックスフリーのモデルよりも効率が優れています。さらに、弱い教師付きパノプティックセグメンテーションのための新しい形式のポイントベースの注釈を提案します。モノとモノの両方にいくつかのランダムなポイントが必要なだけで、人間の注釈コストが劇的に削減されます。提案されたパノプティック FCN は、この弱い教師ありの設定ではるかに優れたパフォーマンスを発揮することも証明されており、インスタンスごとにランダムに注釈が付けられた 20 点のみで、完全教師ありのパフォーマンスの 82% を達成します。 COCO、VOC 2012、Cityscapes、および Mapillary Vistas のデータセットに対する Panoptic FCN の有効性と効率性は、広範な実験によって実証されています。そして、完全および弱教師付きパノプティックセグメンテーションの両方の新しい主要なベンチマークを設定します。私たちのコードとモデルは、https://github.com/dvlab-research/PanopticFCN で公開されています。

In this paper, we present a conceptually simple, strong, and efficient framework for fully- and weakly-supervised panoptic segmentation, called Panoptic FCN. Our approach aims to represent and predict foreground things and background stuff in a unified fully convolutional pipeline, which can be optimized with point-based fully or weak supervision. In particular, Panoptic FCN encodes each object instance or stuff category with the proposed kernel generator and produces the prediction by convolving the high-resolution feature directly. With this approach, instance-aware and semantically consistent properties for things and stuff can be respectively satisfied in a simple generate-kernel-then-segment workflow. Without extra boxes for localization or instance separation, the proposed approach outperforms the previous box-based and -free models with high efficiency. Furthermore, we propose a new form of point-based annotation for weakly-supervised panoptic segmentation. It only needs several random points for both things and stuff, which dramatically reduces the annotation cost of human. The proposed Panoptic FCN is also proved to have much superior performance in this weakly-supervised setting, which achieves 82% of the fully-supervised performance with only 20 randomly annotated points per instance. Extensive experiments demonstrate the effectiveness and efficiency of Panoptic FCN on COCO, VOC 2012, Cityscapes, and Mapillary Vistas datasets. And it sets up a new leading benchmark for both fully- and weakly-supervised panoptic segmentation. Our code and models are made publicly available at https://github.com/dvlab-research/PanopticFCN.

updated: Thu Oct 13 2022 03:42:27 GMT+0000 (UTC)

published: Tue Aug 17 2021 15:28:53 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト