Probabilistic Contrastive Learning for Domain Adaptation

Junjie Li; Yixin Zhang; Zilei Wang; Keyu Tu

ドメイン適応のための確率的対照学習

標準的な対比学習は、ℓ_2 正規化を使用して抽出された特徴に作用します。ただし、ドメイン適応タスクの場合、標準パラダイムを使用した対照学習はうまく機能しないことがわかります。その主な理由は、最適化中にクラスの重み (最終的な全結合層の重み) が関与しないためです。これは、生成されたフィーチャがソースデータから学習したクラスの重みの周りにクラスター化されることを保証しません。この問題に取り組むために、このホワイトペーパーではシンプルかつ強力な確率的対比学習 (PCL) を提案します。これは、コンパクトな機能を生成するだけでなく、それらをクラスの重みの周りに分散するように強制します。具体的には、特徴を確率に置き換え、ℓ_2 正規化を削除することにより、従来の対照学習パラダイム (特徴 + ℓ_2 正規化) を破ります。このようにして、確率を適用してワンホットフォームに近づけることができるため、特徴とクラスの重みの間の距離を狭めることができます。 PCL は簡潔さのために汎用的であり、さまざまなタスクに使用できます。この論文では、教師なしドメイン適応 (UDA)、半教師ありドメイン適応 (SSDA)、半教師あり学習 (SSL)、UDA 検出、および UDA セマンティックセグメンテーションの 5 つのタスクについて広範な実験を行います。結果は、PCL がこれらのタスクに大きな利益をもたらすことができることを示しています。特に、セグメンテーションタスクの場合、PCL の恩恵を受けて、私たちの方法はより少ないトレーニングコストで CPSL-D を達成するか、それを上回ることさえあります (1*3090、5 日 vs 4*V100、11 日)。コードは https://github.com/ljjcoder/Probabilistic-Contrastive-Learning で入手できます。

The standard contrastive learning acts on the extracted features with ℓ_2 normalization. For domain adaptation tasks, however, we find that contrastive learning with the standard paradigm does not perform well. The reason is mainly that the class weights (weights of the final fully connected layer) are not involved during optimization, which does not guarantee the produced features to be clustered around the class weights learned from source data. To tackle this issue, we propose a simple yet powerful probabilistic contrastive learning (PCL) in this paper, which not only produces compact features but also enforces them to be distributed around the class weights. Specifically, we break the traditional contrastive learning paradigm (feature+ℓ_2 normalization) by replacing the features with probabilities and removing ℓ_2 normalization. In this way, we can enforce the probability to approximate the one-hot form, thereby narrowing the distance between the features and the class weights. PCL is generic due to conciseness, which can be used for different tasks. In this paper, we conduct extensive experiments on five tasks, i.e., unsupervised domain adaptation (UDA), semi-supervised domain adaptation (SSDA), semi-supervised learning (SSL), UDA detection, and UDA semantic segmentation. The results demonstrate that our PCL can bring significant gains for these tasks. In particular, for segmentation tasks, with the blessing of PCL, our method achieves or even surpasses CPSL-D with a smaller training cost (1*3090, 5 days vs 4*V100, 11 days). Code is available at https://github.com/ljjcoder/Probabilistic-Contrastive-Learning.

updated: Mon Nov 21 2022 12:57:26 GMT+0000 (UTC)

published: Thu Nov 11 2021 02:08:07 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト