Learning to Combat Noisy Labels via Classification Margins

Jason Z. Lin; Jelena Bradic

分類マージンを介してノイズの多いラベルと戦うことを学ぶ

ノイズの多いラベルでトレーニングされた深いニューラルネットワークは、クリーンなインスタンスとノイズの多いインスタンスを区別する能力をすぐに失うことが知られています。初期の学習フェーズが終了した後、ネットワークはノイズの多いインスタンスを記憶します。これにより、一般化のパフォーマンスが大幅に低下します。この問題を解決するために、ノイズの多いインスタンスの記憶が抑制される新しい堅牢な学習方法であるMARVEL（MARgins Via Early Learning）を提案します。分類マージンのエポック履歴に基づいて、すべてのインスタンスの「適合度」の良さを追跡する新しい検定統計量を提案します。連続する学習エポックのシーケンスでその分類マージンが小さい場合、そのインスタンスはノイズが多いと宣言され、ネットワークはそのインスタンスでの学習を放棄します。その結果、ネットワークは最初にノイズの可能性のあるインスタンスにフラグを立て、次にそのインスタンスでの学習が改善できるかどうかを確認するのを待ちます。改善できない場合、ネットワークはこのインスタンスを安全に破棄できることを確信して学習します。また、困難なインスタンスの重みを大きくすることができるMARVEL +を提案します。これにより、ネットワークはそれらに焦点を合わせて学習を改善し、その結果、一般化を行うことができます。合成ラベルノイズを含むベンチマークデータセットと実際のデータセットの実験結果は、MARVELがさまざまなノイズレベルで一貫して他のベースラインを上回り、非対称ノイズの下で大幅に大きなマージンがあることを示しています。

A deep neural network trained on noisy labels is known to quickly lose its power to discriminate clean instances from noisy ones. After the early learning phase has ended, the network memorizes the noisy instances, which leads to a significant degradation in its generalization performance. To resolve this issue, we propose MARVEL (MARgins Via Early Learning), a new robust learning method where the memorization of the noisy instances is curbed. We propose a new test statistic that tracks the goodness of "fit" of every instance based on the epoch-history of its classification margins. If its classification margin is small in a sequence of consecutive learning epochs, that instance is declared noisy and the network abandons learning on it. Consequently, the network first flags a possibly noisy instance, and then waits to see if learning on that instance can be improved and if not, the network learns with confidence that this instance can be safely abandoned. We also propose MARVEL+, where arduous instances can be upweighted, enabling the network to focus and improve its learning on them and consequently its generalization. Experimental results on benchmark datasets with synthetic label noise and real-world datasets show that MARVEL outperforms other baselines consistently across different noise levels, with a significantly larger margin under asymmetric noise.

updated: Thu Sep 02 2021 15:40:52 GMT+0000 (UTC)

published: Mon Feb 01 2021 10:35:25 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト