SELF: Learning to Filter Noisy Labels with Self-Ensembling

Duc Tam Nguyen; Chaithanya Kumar Mummadi; Thi Phuong Nhung Ngo; Thi Hoai Phuong Nguyen; Laura Beggel; Thomas Brox

SELF：自己組織化によるノイズの多いラベルのフィルタリングの学習

ディープニューラルネットワーク（DNN）は、ノイズの多いラベルで十分な時間トレーニングされたときに、データセットに過剰に適合することが示されています。この問題を克服するために、トレーニング中に間違ったラベルを段階的に除外するためのシンプルで効果的な自己集合ラベルフィルタリング（SELF）を提示します。私たちの方法は、潜在的にノイズのない（クリーンな）ラベルからのみ監視を徐々に許可し、フィルタリングされたノイズのあるラベルで学習を停止することにより、タスクのパフォーマンスを向上させます。フィルタリングでは、異なるトレーニングエポックでのネットワーク出力を使用して、トレーニングデータセット全体の予測の移動平均を形成します。これらのアンサンブル推定は、最新のトレーニングエポックでのネットワークの単一推定よりも、トレーニング全体で一貫性のない予測をより正確に識別することを示しています。フィルター処理されたサンプルは、教師ありトレーニング損失から完全に削除されますが、教師なし損失での半教師あり学習により動的に活用されます。対称および非対称のラベルノイズと異なるノイズ比でのさまざまな画像分類タスクに対するこのようなアプローチのプラスの効果を示します。これは、異なるデータセット全体のノイズ認識学習に関するこれまでのすべての作業を大幅に上回り、幅広いネットワークアーキテクチャセットに適用できます。

Deep neural networks (DNNs) have been shown to over-fit a dataset when being trained with noisy labels for a long enough time. To overcome this problem, we present a simple and effective method self-ensemble label filtering (SELF) to progressively filter out the wrong labels during training. Our method improves the task performance by gradually allowing supervision only from the potentially non-noisy (clean) labels and stops learning on the filtered noisy labels. For the filtering, we form running averages of predictions over the entire training dataset using the network output at different training epochs. We show that these ensemble estimates yield more accurate identification of inconsistent predictions throughout training than the single estimates of the network at the most recent training epoch. While filtered samples are removed entirely from the supervised training loss, we dynamically leverage them via semi-supervised learning in the unsupervised loss. We demonstrate the positive effect of such an approach on various image classification tasks under both symmetric and asymmetric label noise and at different noise ratios. It substantially outperforms all previous works on noise-aware learning across different datasets and can be applied to a broad set of network architectures.

updated: Fri Oct 04 2019 08:59:54 GMT+0000 (UTC)

published: Fri Oct 04 2019 08:59:54 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト