Spatially Consistent Representation Learning

Byungseok Roh; Wuhyun Shin; Ildoo Kim; Sungwoong Kim

空間的に一貫した表現学習

自己教師あり学習は、ラベルのない画像から転送可能な表現を取得するために広く使用されています。特に、最近の対照的な学習方法は、下流の画像分類タスクで印象的なパフォーマンスを示しています。これらの対照的な方法は、主にセマンティック保存変換の下で画像レベルで不変のグローバル表現を生成することに焦点を当てていますが、ローカル表現の空間的一貫性を見落とす傾向があるため、オブジェクト検出やインスタンスセグメンテーションなどのローカリゼーションタスクの事前トレーニングに制限があります。さらに、既存の対照的な方法で使用される積極的にトリミングされたビューは、単一の画像の意味的に異なる領域間の表現距離を最小限に抑えることができます。この論文では、マルチオブジェクトおよび場所固有のタスクのための空間的に一貫した表現学習アルゴリズム（SCRL）を提案します。特に、幾何学的な平行移動とズーム操作に従って、ランダムにトリミングされた局所領域のコヒーレントな空間表現を生成しようとする、新しい自己監視目的を考案します。ベンチマークデータセットを使用したさまざまなダウンストリームローカリゼーションタスクで、提案されたSCRLは、画像レベルの教師あり事前トレーニングおよび最先端の教師あり学習方法に比べて大幅なパフォーマンスの向上を示しています。

Self-supervised learning has been widely used to obtain transferrable representations from unlabeled images. Especially, recent contrastive learning methods have shown impressive performances on downstream image classification tasks. While these contrastive methods mainly focus on generating invariant global representations at the image-level under semantic-preserving transformations, they are prone to overlook spatial consistency of local representations and therefore have a limitation in pretraining for localization tasks such as object detection and instance segmentation. Moreover, aggressively cropped views used in existing contrastive methods can minimize representation distances between the semantically different regions of a single image. In this paper, we propose a spatially consistent representation learning algorithm (SCRL) for multi-object and location-specific tasks. In particular, we devise a novel self-supervised objective that tries to produce coherent spatial representations of a randomly cropped local region according to geometric translations and zooming operations. On various downstream localization tasks with benchmark datasets, the proposed SCRL shows significant performance improvements over the image-level supervised pretraining as well as the state-of-the-art self-supervised learning methods.

updated: Wed Mar 10 2021 15:23:45 GMT+0000 (UTC)

published: Wed Mar 10 2021 15:23:45 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト