Generalized Jensen-Shannon Divergence Loss for Learning with Noisy Labels

Erik Englesson; Hossein Azizpour

ノイズの多いラベルを使用した学習のための一般化されたイェンセンシャノン発散損失

以前の研究では、平均絶対誤差（MAE）などのノイズに強い損失関数を標準のカテゴリ損失関数（クロスエントロピー（CE）など）と組み合わせて学習性を向上させることが有益であることがわかっています。ここでは、イェンセン・シャノン発散をノイズロバストな損失関数として使用することを提案し、制御可能な混合パラメーターを使用してCEとMAEの間を興味深い補間することを示します。さらに、CEはノイズの多いデータポイントの周囲で一貫性が低いことを示す重要な観察を行います。この観察に基づいて、データポイント周辺の一貫性を促進するために、複数の分布に対してイェンセンシャノン発散の一般化バージョンを採用します。この損失関数を使用して、合成（CIFAR）ノイズと実世界（WebVisionなど）の両方のノイズについて、さまざまなノイズレートで最新の結果を示します。

Prior works have found it beneficial to combine provably noise-robust loss functions e.g., mean absolute error (MAE) with standard categorical loss function e.g. cross entropy (CE) to improve their learnability. Here, we propose to use Jensen-Shannon divergence as a noise-robust loss function and show that it interestingly interpolate between CE and MAE with a controllable mixing parameter. Furthermore, we make a crucial observation that CE exhibit lower consistency around noisy data points. Based on this observation, we adopt a generalized version of the Jensen-Shannon divergence for multiple distributions to encourage consistency around data points. Using this loss function, we show state-of-the-art results on both synthetic (CIFAR), and real-world (e.g., WebVision) noise with varying noise rates.

updated: Fri Oct 29 2021 06:26:48 GMT+0000 (UTC)

published: Mon May 10 2021 17:19:38 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト