The Hidden Uniform Cluster Prior in Self-Supervised Learning

Mahmoud Assran; Randall Balestriero; Quentin Duval; Florian Bordes; Ishan Misra; Piotr Bojanowski; Pascal Vincent; Michael Rabbat; Nicolas Ballas

自己教師あり学習における隠れ均一クラスタプライア

表現学習で成功するパラダイムは、ミニバッチ統計 (SimCLR、VICReg、SwAV、MSN など) に基づくタスクを使用して自己教師あり事前トレーニングを実行することです。これらすべての方法の定式化において、データの均一なクラスタリングを可能にする機能を学習する前に見落とされていることを示します。 ImageNet などのクラスバランスのとれたデータで事前トレーニングを行う場合、この事前設定により著しくセマンティックな表現が得られますが、クラスの不均衡なデータで事前トレーニングを行うとパフォーマンスが低下する可能性があることを示します。従来の均一性事前確率から離れ、代わりにべき乗分散特徴クラスターを優先することで、現実世界のクラス不均衡データセットで学習された表現の品質を向上できることを示します。これを実証するために、Masked Siamese Networks (MSN) メソッドの拡張機能を開発して、任意の特徴事前分布の使用をサポートします。

A successful paradigm in representation learning is to perform self-supervised pretraining using tasks based on mini-batch statistics (e.g., SimCLR, VICReg, SwAV, MSN). We show that in the formulation of all these methods is an overlooked prior to learn features that enable uniform clustering of the data. While this prior has led to remarkably semantic representations when pretraining on class-balanced data, such as ImageNet, we demonstrate that it can hamper performance when pretraining on class-imbalanced data. By moving away from conventional uniformity priors and instead preferring power-law distributed feature clusters, we show that one can improve the quality of the learned representations on real-world class-imbalanced datasets. To demonstrate this, we develop an extension of the Masked Siamese Networks (MSN) method to support the use of arbitrary features priors.

updated: Thu Oct 13 2022 18:10:01 GMT+0000 (UTC)

published: Thu Oct 13 2022 18:10:01 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト