Semi-supervised Contrastive Learning with Similarity Co-calibration

Yuhang Zhang; Xiaopeng Zhang; Robert. C. Qiu; Jie Li; Haohang Xu; Qi Tian

類似性協調キャリブレーションを使用した半教師あり対照学習

半教師あり学習は、ラベルのない大量のデータを活用する効果的な方法として機能します。この論文では、半教師あり学習（SsCL）と呼ばれる新しいトレーニング戦略を提案します。これは、自己教師あり学習でよく知られているコントラスト損失と半教師あり学習でのクロスエントロピー損失を組み合わせ、共同で最適化します。エンドツーエンドの方法で2つの目的。ハイライトは、同じモデルの重みで予測と再トレーニングを行う自己トレーニングベースの半教師あり学習とは異なり、SsCLは、2つのブランチ間でラベルのないデータの予測を交換し、共同キャリブレーション手順を作成することです。より良い予測に有益であり、極小値に閉じ込められることを回避します。この目標に向けて、対照損失ブランチは、クロスエントロピーブランチから生成された最も近い近傍を使用して、サンプル間のペアワイズ類似性をモデル化し、次に、対照的な類似性でクロスエントロピーブランチの予測分布を較正します。 SsCLがより識別力のある表現を生成し、少数のショット学習に有益であることを示します。特に、ResNet50をバックボーンとするImageNetでは、SsCLは1％と10％のラベル付きサンプルでそれぞれ60.2％と72.1％のトップ1精度を達成します。これはベースラインを大幅に上回り、以前の半教師ありおよび自己教師ありよりも優れています。メソッド。

Semi-supervised learning acts as an effective way to leverage massive unlabeled data. In this paper, we propose a novel training strategy, termed as Semi-supervised Contrastive Learning (SsCL), which combines the well-known contrastive loss in self-supervised learning with the cross entropy loss in semi-supervised learning, and jointly optimizes the two objectives in an end-to-end way. The highlight is that different from self-training based semi-supervised learning that conducts prediction and retraining over the same model weights, SsCL interchanges the predictions over the unlabeled data between the two branches, and thus formulates a co-calibration procedure, which we find is beneficial for better prediction and avoid being trapped in local minimum. Towards this goal, the contrastive loss branch models pairwise similarities among samples, using the nearest neighborhood generated from the cross entropy branch, and in turn calibrates the prediction distribution of the cross entropy branch with the contrastive similarity. We show that SsCL produces more discriminative representation and is beneficial to few shot learning. Notably, on ImageNet with ResNet50 as the backbone, SsCL achieves 60.2% and 72.1% top-1 accuracy with 1% and 10% labeled samples, respectively, which significantly outperforms the baseline, and is better than previous semi-supervised and self-supervised methods.

updated: Sun May 16 2021 09:13:56 GMT+0000 (UTC)

published: Sun May 16 2021 09:13:56 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト