Fully Convolutional Networks for Panoptic Segmentation with Point-based Supervision

Yanwei Li; Hengshuang Zhao; Xiaojuan Qi; Yukang Chen; Lu Qi; Liwei Wang; Zeming Li; Jian Sun; Jiaya Jia

ポイントベースの監視によるパノプティコンセグメンテーションのための完全畳み込みネットワーク

この論文では、パノプティコンFCNと呼ばれる、完全に監視されたパノプティコンセグメンテーションと弱く監視されたパノプティコンセグメンテーションのための概念的にシンプルで強力かつ効率的なフレームワークを紹介します。私たちのアプローチは、ポイントベースの完全または弱い監視で最適化できる、統合された完全畳み込みパイプラインで前景のものと背景のものを表現および予測することを目的としています。特に、Panoptic FCNは、提案されたカーネルジェネレーターを使用して各オブジェクトインスタンスまたはスタッフカテゴリをエンコードし、高解像度機能を直接畳み込むことによって予測を生成します。このアプローチを使用すると、インスタンスを認識し、意味的に一貫性のあるものやもののプロパティを、単純な生成-カーネル-次にセグメント化ワークフローでそれぞれ満たすことができます。ローカリゼーションまたはインスタンス分離のための追加のボックスがない場合、提案されたアプローチは、以前のボックスベースのフリーモデルよりも高効率で優れています。さらに、弱教師ありパノプティコンセグメンテーションのための新しい形式のポイントベースの注釈を提案します。それは物事と物事の両方のためにいくつかのランダムなポイントを必要とするだけであり、それは人間の注釈コストを劇的に削減します。提案されたPanopticFCNは、この弱く監視された設定ではるかに優れたパフォーマンスを発揮することも証明されています。これにより、インスタンスごとにランダムに注釈が付けられたポイントが20個だけで、完全に監視されたパフォーマンスの82％が達成されます。広範な実験により、COCO、VOC 2012、Cityscapes、およびMapillaryVistasデータセットに対するPanopticFCNの有効性と効率が実証されています。また、完全に監視されたパノラマセグメンテーションと弱く監視されたパノラマセグメンテーションの両方について、新しい主要なベンチマークを設定します。私たちのコードとモデルはhttps://github.com/dvlab-research/PanopticFCNで公開されています

In this paper, we present a conceptually simple, strong, and efficient framework for fully- and weakly-supervised panoptic segmentation, called Panoptic FCN. Our approach aims to represent and predict foreground things and background stuff in a unified fully convolutional pipeline, which can be optimized with point-based fully or weak supervision. In particular, Panoptic FCN encodes each object instance or stuff category with the proposed kernel generator and produces the prediction by convolving the high-resolution feature directly. With this approach, instance-aware and semantically consistent properties for things and stuff can be respectively satisfied in a simple generate-kernel-then-segment workflow. Without extra boxes for localization or instance separation, the proposed approach outperforms the previous box-based and -free models with high efficiency. Furthermore, we propose a new form of point-based annotation for weakly-supervised panoptic segmentation. It only needs several random points for both things and stuff, which dramatically reduces the annotation cost of human. The proposed Panoptic FCN is also proved to have much superior performance in this weakly-supervised setting, which achieves 82% of the fully-supervised performance with only 20 randomly annotated points per instance. Extensive experiments demonstrate the effectiveness and efficiency of Panoptic FCN on COCO, VOC 2012, Cityscapes, and Mapillary Vistas datasets. And it sets up a new leading benchmark for both fully- and weakly-supervised panoptic segmentation. Our code and models are made publicly available at https://github.com/dvlab-research/PanopticFCN

updated: Tue Aug 17 2021 15:28:53 GMT+0000 (UTC)

published: Tue Aug 17 2021 15:28:53 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト