Delving into Sample Loss Curve to Embrace Noisy and Imbalanced Data

Shenwang Jiang; Jianan Li; Ying Wang; Bo Huang; Zhang Zhang; Tingfa Xu

ノイズの多い不均衡なデータを受け入れるためのサンプル損失曲線を掘り下げる

破損したラベルとクラスの不均衡は、実際に収集されたトレーニングデータで一般的に発生し、ディープニューラルネットワーク（DNN）の過剰適合につながりやすくなります。既存のアプローチは、サンプル再重み付け戦略を採用することによってこれらの問題を軽減します。これは、重み付け関数を設計することによってサンプルを再重み付けすることです。ただし、いずれか1つのタイプのデータバイアスのみを含むトレーニングデータにのみ適用できます。ただし、実際には、ラベルが破損し、テールクラスの偏ったサンプルがトレーニングデータに共存するのが一般的です。それらを同時に処理する方法は重要ですが、十分に検討されていない問題です。この論文では、これら2種類のバイアスされたサンプルは、同様の過渡損失を持っていますが、損失曲線に識別可能な傾向と特性があり、サンプルの重み割り当てに貴重な事前確率を提供できることがわかります。これに動機付けられて、損失曲線を掘り下げ、新しいプローブと割り当てのトレーニング戦略を提案します。プロービング段階では、介入なしでバイアスされたトレーニングデータ全体でネットワークをトレーニングし、各サンプルの損失曲線を追加の属性;割り当て段階では、結果の属性をCurveNetという名前の新しく設計された曲線認識ネットワークにフィードして、各サンプルのバイアスタイプを識別し、メタ学習を通じて適切な重みを適応的に割り当てることを学習します。メタ学習のトレーニング速度も、その適用を妨げます。それを解決するために、最下層をスキップすることによってトレーニング速度を加速するスキップ層メタ最適化（SLMO）という名前の方法を提案します。広範な合成および実際の実験は、提案された方法を十分に検証します。これは、複数の挑戦的なベンチマークで最先端のパフォーマンスを実現します。

Corrupted labels and class imbalance are commonly encountered in practically collected training data, which easily leads to over-fitting of deep neural networks (DNNs). Existing approaches alleviate these issues by adopting a sample re-weighting strategy, which is to re-weight sample by designing weighting function. However, it is only applicable for training data containing only either one type of data biases. In practice, however, biased samples with corrupted labels and of tailed classes commonly co-exist in training data. How to handle them simultaneously is a key but under-explored problem. In this paper, we find that these two types of biased samples, though have similar transient loss, have distinguishable trend and characteristics in loss curves, which could provide valuable priors for sample weight assignment. Motivated by this, we delve into the loss curves and propose a novel probe-and-allocate training strategy: In the probing stage, we train the network on the whole biased training data without intervention, and record the loss curve of each sample as an additional attribute; In the allocating stage, we feed the resulting attribute to a newly designed curve-perception network, named CurveNet, to learn to identify the bias type of each sample and assign proper weights through meta-learning adaptively. The training speed of meta learning also blocks its application. To solve it, we propose a method named skip layer meta optimization (SLMO) to accelerate training speed by skipping the bottom layers. Extensive synthetic and real experiments well validate the proposed method, which achieves state-of-the-art performance on multiple challenging benchmarks.

updated: Thu Dec 30 2021 09:20:07 GMT+0000 (UTC)

published: Thu Dec 30 2021 09:20:07 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト