Contrastive Learning Improves Model Robustness Under Label Noise

Aritra Ghosh; Andrew Lan

対照学習はラベルノイズの下でモデルのロバスト性を改善します

カテゴリクロスエントロピー（CCE）損失でトレーニングされたディープニューラルネットワークベースの分類器は、トレーニングデータのラベルノイズに敏感です。ラベルノイズの影響を軽減できる一般的なタイプの方法の1つは、監視ありの堅牢な方法と見なすことができます。 CCE損失を、ラベルノイズに対してロバストな損失に置き換えるか、トレーニングサンプルの重みを再設定して、損失値が高いサンプルの重みを減らすことができます。最近、半教師あり学習（SSL）を使用する別のタイプの方法が提案されました。これは、これらの教師ありロバストな方法を補強して、（おそらく）ノイズの多いサンプルをより効果的に活用します。監視された堅牢な方法は、さまざまなデータタイプで十分に機能しますが、ラベルノイズ下の画像分類タスクではSSL方法より劣ることが示されています。したがって、これらの監視されたロバストな方法が、ラベルのないサンプルをより効果的に利用できれば、うまく機能するかどうかはまだわかりません。この論文では、対照学習を通じて学習された表現を使用して教師ありロバスト手法を初期化することにより、ラベルノイズ下でのパフォーマンスが大幅に向上することを示します。驚くべきことに、最も単純な方法（CCE損失を使用して分類器をトレーニングする）でさえ、対照学習で初期化すると、高いラベルノイズの下で最先端のSSL方法を50％以上上回ることができます。私たちの実装はhttps://github.com/arghosh/noisy_label_pretrainで公開されます。

Deep neural network-based classifiers trained with the categorical cross-entropy (CCE) loss are sensitive to label noise in the training data. One common type of method that can mitigate the impact of label noise can be viewed as supervised robust methods; one can simply replace the CCE loss with a loss that is robust to label noise, or re-weight training samples and down-weight those with higher loss values. Recently, another type of method using semi-supervised learning (SSL) has been proposed, which augments these supervised robust methods to exploit (possibly) noisy samples more effectively. Although supervised robust methods perform well across different data types, they have been shown to be inferior to the SSL methods on image classification tasks under label noise. Therefore, it remains to be seen that whether these supervised robust methods can also perform well if they can utilize the unlabeled samples more effectively. In this paper, we show that by initializing supervised robust methods using representations learned through contrastive learning leads to significantly improved performance under label noise. Surprisingly, even the simplest method (training a classifier with the CCE loss) can outperform the state-of-the-art SSL method by more than 50% under high label noise when initialized with contrastive learning. Our implementation will be publicly available at https://github.com/arghosh/noisy_label_pretrain.

updated: Mon Apr 19 2021 00:27:58 GMT+0000 (UTC)

published: Mon Apr 19 2021 00:27:58 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト