FSS-1000: A 1000-Class Dataset for Few-Shot Segmentation

Xiang Li; Tianhan Wei; Yau Pun Chen; Yu-Wing Tai; Chi-Keung Tang

FSS-1000：少数ショットセグメンテーション用の1000クラスのデータセット

過去数年にわたって、PASCAL VOC、ImageNet、COCOなどの人間が注釈を付けた大規模なデータセットを利用できるため、画像認識におけるディープラーニングの成功を目の当たりにしてきました。これらのデータセットは広範囲のオブジェクトカテゴリをカバーしていますが、まだ含まれていないかなりの数のオブジェクトがあります。大量の人間による注釈なしで同じタスクを実行できますか？このホワイトペーパーでは、注釈付きのトレーニング例の数が5つに制限されている、少数ショットのオブジェクトセグメンテーションに関心があります。私たちのアプローチのパフォーマンスを評価および検証するために、グラウンドトゥルーセグメンテーションのピクセルごとのアノテーションを持つ1000個のオブジェクトクラスで構成される、数ショットのセグメンテーションデータセットFSS-1000を構築しました。 FSS-1000独自のデータセットには、以前のデータセットでは見られなかった、または注釈が付けられていないかなりの数のオブジェクトが含まれています。たとえば、小さな日常のオブジェクト、商品、漫画のキャラクター、ロゴなどです。 VGG-16、ResNet-101、およびInception。驚いたことに、FSS-1000を使用してモデルを最初からトレーニングすると、FSS-1000の100倍を超えるImageNetによって事前トレーニングされた重みを使用したトレーニングよりも同等で優れた結果が得られることがわかりました。私たちのアプローチとデータセットはどちらもシンプルで効果的で、注釈付きのトレーニング例がほとんどないため、新しいオブジェクトクラスのセグメンテーションを簡単に拡張できます。データセットはhttps://github.com/HKUSTCV/FSS-1000で入手できます。

Over the past few years, we have witnessed the success of deep learning in image recognition thanks to the availability of large-scale human-annotated datasets such as PASCAL VOC, ImageNet, and COCO. Although these datasets have covered a wide range of object categories, there are still a significant number of objects that are not included. Can we perform the same task without a lot of human annotations? In this paper, we are interested in few-shot object segmentation where the number of annotated training examples are limited to 5 only. To evaluate and validate the performance of our approach, we have built a few-shot segmentation dataset, FSS-1000, which consists of 1000 object classes with pixelwise annotation of ground-truth segmentation. Unique in FSS-1000, our dataset contains significant number of objects that have never been seen or annotated in previous datasets, such as tiny daily objects, merchandise, cartoon characters, logos, etc. We build our baseline model using standard backbone networks such as VGG-16, ResNet-101, and Inception. To our surprise, we found that training our model from scratch using FSS-1000 achieves comparable and even better results than training with weights pre-trained by ImageNet which is more than 100 times larger than FSS-1000. Both our approach and dataset are simple, effective, and easily extensible to learn segmentation of new object classes given very few annotated training examples. Dataset is available at https://github.com/HKUSTCV/FSS-1000.

updated: Wed Apr 29 2020 22:35:58 GMT+0000 (UTC)

published: Mon Jul 29 2019 11:47:25 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト