Self-EMD: Self-Supervised Object Detection without ImageNet

Songtao Liu; Zeming Li; Jian Sun

自己EMD：ImageNetを使用しない自己監視オブジェクト検出

本論文では、物体検出のための新しい自己教師あり表現学習法、Self-EMDを提案する。私たちの方法は、ImageNetのような一般的に使用される象徴的なオブジェクトの画像データセットの代わりに、COCOのようなラベルのない非象徴的な画像データセットで直接トレーニングされました。畳み込み特徴マップを画像の埋め込みとして保持して空間構造を保持し、Earth Mover's Distance（EMD）を採用して2つの埋め込み間の類似性を計算します。 Faster R-CNN（ResNet50-FPN）ベースラインは、COCOで39.8％のmAPを達成します。これは、ImageNetで事前トレーニングされた最先端の自己監視方式と同等です。さらに重要なことに、ラベルなしの画像を増やすことで40.4％mAPにさらに改善でき、より簡単に取得できるラベルなしのデータを活用できる大きな可能性を示しています。コードが利用可能になります。

In this paper, we propose a novel self-supervised representation learning method, Self-EMD, for object detection. Our method directly trained on unlabeled non-iconic image dataset like COCO, instead of commonly used iconic-object image dataset like ImageNet. We keep the convolutional feature maps as the image embedding to preserve spatial structures and adopt Earth Mover's Distance (EMD) to compute the similarity between two embeddings. Our Faster R-CNN (ResNet50-FPN) baseline achieves 39.8% mAP on COCO, which is on par with the state of the art self-supervised methods pre-trained on ImageNet. More importantly, it can be further improved to 40.4% mAP with more unlabeled images, showing its great potential for leveraging more easily obtained unlabeled data. Code will be made available.

updated: Mon Mar 22 2021 09:41:15 GMT+0000 (UTC)

published: Fri Nov 27 2020 11:27:19 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト