Tuned Contrastive Learning

Chaitanya Animesh; Manmohan Chandraker

調整された対照学習

最近、対照学習ベースの損失関数は、その最先端 (SOTA) パフォーマンスのおかげで、視覚的な自己教師あり表現学習においてますます人気が高まっています。 SimCLR などの最新の対照学習損失関数のほとんどは Info-NCE ベースであり、アンカーごとに 1 つの正と複数の負にのみ一般化されます。最近の最先端の教師あり対比 (SupCon) 損失は、バッチ内の複数の正と複数の負を一般化することで、自己教師あり対比学習を教師あり設定に拡張し、クロスエントロピー損失を改善します。この論文では、バッチ内の複数のポジティブと複数のネガティブを一般化し、ハードポジティブとハードネガティブからの勾配応答を調整および改善するパラメーターを提供する、新しい対照損失関数である調整対照学習 (TCL) 損失を提案します。損失関数の勾配応答の理論的分析を提供し、それが SupCon 損失の応答よりもどのように優れているかを数学的に示します。経験的に、複数の分類タスクデータセットの教師あり設定で損失関数を SupCon 損失およびクロスエントロピー損失と比較します。また、さまざまなハイパーパラメーター設定に対する損失関数の安定性も示します。最後に、TCL をさまざまな SOTA 自己教師あり学習手法と比較し、損失関数が教師あり設定と自己教師あり学習の両方の設定で SOTA 手法と同等のパフォーマンスを達成することを示します。

In recent times, contrastive learning based loss functions have become increasingly popular for visual self-supervised representation learning owing to their state-of-the-art (SOTA) performance. Most of the modern contrastive learning loss functions like SimCLR are Info-NCE based and generalize only to one positive and multiple negatives per anchor. A recent state-of-the-art, supervised contrastive (SupCon) loss, extends self-supervised contrastive learning to supervised setting by generalizing to multiple positives and multiple negatives in a batch and improves upon the cross-entropy loss. In this paper, we propose a novel contrastive loss function - Tuned Contrastive Learning (TCL) loss, that generalizes to multiple positives and multiple negatives within a batch and offers parameters to tune and improve the gradient responses from hard positives and hard negatives. We provide theoretical analysis of our loss function's gradient response and show mathematically how it is better than that of SupCon loss. Empirically, we compare our loss function with SupCon loss and cross-entropy loss in a supervised setting on multiple classification-task datasets. We also show the stability of our loss function to various hyper-parameter settings. Finally, we compare TCL with various SOTA self-supervised learning methods and show that our loss function achieves performance on par with SOTA methods in both supervised and self-supervised settings.

updated: Thu May 18 2023 03:26:37 GMT+0000 (UTC)

published: Thu May 18 2023 03:26:37 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト