Understanding Contrastive Representation Learning through Alignment and Uniformity on the Hypersphere

Tongzhou Wang; Phillip Isola

超球上の整列と均一性による対照表現学習の理解

対照的な表現学習は、実際に非常に成功しています。この作業では、コントラスト損失に関連する2つの重要なプロパティを特定します。（1）正のペアからの特徴の位置合わせ（近さ）、および（2）超球上の（正規化された）特徴の誘導分布の均一性。漸近的に、コントラスト損失がこれらのプロパティを最適化し、下流のタスクへのプラスの影響を分析することを証明します。経験的に、各プロパティを定量化するために最適化可能なメトリックを導入します。標準ビジョンと言語データセットに関する広範な実験により、メトリックとダウンストリームタスクのパフォーマンスの両方が強く一致していることが確認されています。注目すべきことに、これらの2つのメトリックを直接最適化すると、対照的な学習よりも下流のタスクで同等またはそれ以上のパフォーマンスの表現が得られます。プロジェクトページ：https://ssnl.github.io/hypersphereコード：https://github.com/SsnL/align_uniform、https：//github.com/SsnL/moco_align_uniform

Contrastive representation learning has been outstandingly successful in practice. In this work, we identify two key properties related to the contrastive loss: (1) alignment (closeness) of features from positive pairs, and (2) uniformity of the induced distribution of the (normalized) features on the hypersphere. We prove that, asymptotically, the contrastive loss optimizes these properties, and analyze their positive effects on downstream tasks. Empirically, we introduce an optimizable metric to quantify each property. Extensive experiments on standard vision and language datasets confirm the strong agreement between both metrics and downstream task performance. Remarkably, directly optimizing for these two metrics leads to representations with comparable or better performance at downstream tasks than contrastive learning. Project Page: https://ssnl.github.io/hypersphere Code: https://github.com/SsnL/align_uniform , https://github.com/SsnL/moco_align_uniform

updated: Tue Nov 10 2020 07:05:17 GMT+0000 (UTC)

published: Wed May 20 2020 17:59:57 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト