Similarity Contrastive Estimation for Self-Supervised Soft Contrastive Learning

Julien Denize; Jaonary Rabarisoa; Astrid Orcesi; Romain Hérault; Stéphane Canu

自己監視型ソフト対照学習のための類似性対照推定

対照表現学習は、効果的な自己教師あり学習方法であることが証明されています。最も成功しているアプローチは、ノイズコントラスト推定（NCE）パラダイムに基づいており、インスタンスのさまざまなビューをポジティブと見なし、他のインスタンスをポジティブと対比する必要のあるノイズと見なします。ただし、データセット内のすべてのインスタンスは同じ分布から抽出され、ノイズと見なされるべきではない基本的なセマンティック情報を共有します。優れたデータ表現には、インスタンス間の関係または意味的類似性が含まれていると主張します。対照学習は暗黙的に関係を学習しますが、ネガティブは学習された関係の質、したがって表現の質に有害なノイズと見なします。この問題を回避するために、類似性対照推定（SCE）と呼ばれるインスタンス間の意味的類似性を使用した対照学習の新しい定式化を提案します。私たちのトレーニング目標は、ソフトな対照学習と見なすことができます。ポジティブとネガティブを厳密に分類する代わりに、セマンティックの類似性に基づいてインスタンスをプッシュまたはプルする連続分布を提案します。ターゲットの類似性分布は、弱い拡張インスタンスから計算され、無関係な関係を排除するためにシャープ化されます。弱い拡張インスタンスはそれぞれ、ターゲットの類似性分布を維持しながらポジティブを対比する強い拡張インスタンスとペアになっています。実験結果は、提案されたSCEがさまざまなデータセットでベースラインのMoCov2およびReSSLを上回り、ImageNet線形評価プロトコルの最先端のアルゴリズムと競合することを示しています。

Contrastive representation learning has proven to be an effective self-supervised learning method. Most successful approaches are based on the Noise Contrastive Estimation (NCE) paradigm and consider different views of an instance as positives and other instances as noise that positives should be contrasted with. However, all instances in a dataset are drawn from the same distribution and share underlying semantic information that should not be considered as noise. We argue that a good data representation contains the relations, or semantic similarity, between the instances. Contrastive learning implicitly learns relations but considers the negatives as noise which is harmful to the quality of the learned relations and therefore the quality of the representation. To circumvent this issue we propose a novel formulation of contrastive learning using semantic similarity between instances called Similarity Contrastive Estimation (SCE). Our training objective can be considered as soft contrastive learning. Instead of hard classifying positives and negatives, we propose a continuous distribution to push or pull instances based on their semantic similarities. The target similarity distribution is computed from weak augmented instances and sharpened to eliminate irrelevant relations. Each weak augmented instance is paired with a strong augmented instance that contrasts its positive while maintaining the target similarity distribution. Experimental results show that our proposed SCE outperforms its baselines MoCov2 and ReSSL on various datasets and is competitive with state-of-the-art algorithms on the ImageNet linear evaluation protocol.

updated: Mon Nov 29 2021 15:19:15 GMT+0000 (UTC)

published: Mon Nov 29 2021 15:19:15 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト