Tuned Contrastive Learning

Chaitanya Animesh; Manmohan Chandraker

調整された対照学習

最近、対照学習ベースの損失関数は、その最先端 (SOTA) パフォーマンスのおかげで、視覚的な自己教師あり表現学習においてますます人気が高まっています。最新の対照学習法のほとんどは、アンカーごとに 1 つのポジティブと複数のネガティブのみを一般化します。最近の最先端の教師あり対比 (SupCon) 損失は、バッチ内の複数の正と負に一般化することで、自己教師あり対比学習を教師あり設定に拡張し、クロスエントロピー損失を改善します。この論文では、新しい対照損失関数である調整対照学習 (TCL) 損失を提案します。これは、バッチ内の複数の正と負に一般化し、ハードポジティブとハードネガティブからの勾配応答を調整および改善するためのパラメーターを提供します。損失関数の勾配応答の理論的分析を提供し、それが SupCon 損失の応答よりもどのように優れているかを数学的に示します。複数の分類タスクデータセットの教師あり設定で損失関数を SupCon 損失およびクロスエントロピー損失と経験的に比較し、その有効性を示します。また、さまざまなハイパーパラメーター設定に対する損失関数の安定性も示します。教師あり設定にのみ適用される SupCon 損失とは異なり、TCL を自己教師あり設定に拡張し、それをさまざまな SOTA 自己教師あり学習手法と経験的に比較する方法を示します。したがって、TCL 損失が教師あり設定と自己教師あり設定の両方で SOTA 手法と同等のパフォーマンスを達成することを示します。

In recent times, contrastive learning based loss functions have become increasingly popular for visual self-supervised representation learning owing to their state-of-the-art (SOTA) performance. Most of the modern contrastive learning methods generalize only to one positive and multiple negatives per anchor. A recent state-of-the-art, supervised contrastive (SupCon) loss, extends self-supervised contrastive learning to supervised setting by generalizing to multiple positives and negatives in a batch and improves upon the cross-entropy loss. In this paper, we propose a novel contrastive loss function -- Tuned Contrastive Learning (TCL) loss, that generalizes to multiple positives and negatives in a batch and offers parameters to tune and improve the gradient responses from hard positives and hard negatives. We provide theoretical analysis of our loss function's gradient response and show mathematically how it is better than that of SupCon loss. We empirically compare our loss function with SupCon loss and cross-entropy loss in supervised setting on multiple classification-task datasets to show its effectiveness. We also show the stability of our loss function to a range of hyper-parameter settings. Unlike SupCon loss which is only applied to supervised setting, we show how to extend TCL to self-supervised setting and empirically compare it with various SOTA self-supervised learning methods. Hence, we show that TCL loss achieves performance on par with SOTA methods in both supervised and self-supervised settings.

updated: Tue May 30 2023 05:00:37 GMT+0000 (UTC)

published: Thu May 18 2023 03:26:37 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト