Safeguarded Dynamic Label Regression for Generalized Noisy Supervision

Jiangchao Yao; Ya Zhang; Ivor W. Tsang; Jun Sun

一般化されたノイズの多い監視のための保護された動的ラベル回帰

ビッグデータの時代には、正確な注釈にかかる費用のかかる労力を削減することを目的とした、ノイズの多いラベルを使用した学習が不可欠になっています。以前のノイズ遷移ベースの方法は、有望な結果を達成し、クラス条件付きノイズの場合のパフォーマンスに関する理論的保証を提示しました。ただし、このタイプのアプローチは、ノイズ遷移の正確な事前推定に大きく依存します。これは通常、実用的ではありません。その後の改善により、Softmaxレイヤーを介したトレーニングの進行状況とともに事前推定が適応されます。ただし、Softmax層のパラメーターは、不適切な確率的近似のために、脆弱なパフォーマンスのために大幅に調整されています。これらの問題に対処するために、ベイジアンフレームワークの下でノイズ遷移を自然に埋め込む潜在クラス条件付きノイズモデル（LCCN）を提案します。ノイズ遷移をディリクレ分布空間に投影することにより、学習は、一部のアドホックパラメトリック空間ではなく、データセット全体に基づくシンプレックスに制約されます。次に、LCCNの動的ラベル回帰法を推定して、潜在ラベルを繰り返し推測し、分類器を確率的にトレーニングし、ノイズをモデル化します。私たちのアプローチは、ノイズ遷移の制限された更新を保護します。これにより、サンプルのバッチを介した以前の任意の調整が回避されます。オープンセットのノイズの多いラベルと半教師あり設定のLCCNをさらに一般化します。制御可能なノイズデータセットであるCIFAR-10とCIFAR-100、および不可知論的なノイズデータセットであるClothing1MとWebVision17を使用して広範な実験を実行します。実験結果は、提案されたモデルがいくつかの最先端の方法よりも優れていることを示しています。

Learning with noisy labels, which aims to reduce expensive labors on accurate annotations, has become imperative in the Big Data era. Previous noise transition based method has achieved promising results and presented a theoretical guarantee on performance in the case of class-conditional noise. However, this type of approaches critically depend on an accurate pre-estimation of the noise transition, which is usually impractical. Subsequent improvement adapts the pre-estimation along with the training progress via a Softmax layer. However, the parameters in the Softmax layer are highly tweaked for the fragile performance due to the ill-posed stochastic approximation. To address these issues, we propose a Latent Class-Conditional Noise model (LCCN) that naturally embeds the noise transition under a Bayesian framework. By projecting the noise transition into a Dirichlet-distributed space, the learning is constrained on a simplex based on the whole dataset, instead of some ad-hoc parametric space. We then deduce a dynamic label regression method for LCCN to iteratively infer the latent labels, to stochastically train the classifier and to model the noise. Our approach safeguards the bounded update of the noise transition, which avoids previous arbitrarily tuning via a batch of samples. We further generalize LCCN for open-set noisy labels and the semi-supervised setting. We perform extensive experiments with the controllable noise data sets, CIFAR-10 and CIFAR-100, and the agnostic noise data sets, Clothing1M and WebVision17. The experimental results have demonstrated that the proposed model outperforms several state-of-the-art methods.

updated: Sat Aug 21 2021 12:31:49 GMT+0000 (UTC)

published: Wed Mar 06 2019 03:20:09 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト