The Resistance to Label Noise in K-NN and DNN Depends on its Concentration

Amnon Drory; Oria Ratzon; Shai Avidan; Raja Giryes

K-NNおよびDNNのラベルノイズに対する耐性は、その濃度に依存します

ラベルノイズが存在する場合のK最近傍法（K-NN）とディープニューラルネットワーク（DNN）の分類性能を調査します。最初に、特定のテスト例に対するDNNの予測が、その地域の近隣のトレーニング例のラベルに依存することを経験的に示します。これにより、独立して重要なラベルノイズが存在する場合のマルチクラスK-NN分類エラーを近似する実現可能な分析式を導出するようになります。次に、K-NNの式がDNNエラーの1次近似として機能する可能性があることを提案します。最後に、開発された式がK-NNおよびDNN分類器の観測されたパフォーマンスに近接していることを経験的に示します。私たちの結果は、いくつかのタイプのラベルノイズに対するDNNのすでに観察された驚くべき耐性を説明するかもしれません。また、ノイズが集中するほどパフォーマンスの低下が大きくなることを示す重要な要素を特徴づけます。

We investigate the classification performance of K-nearest neighbors (K-NN) and deep neural networks (DNNs) in the presence of label noise. We first show empirically that a DNN's prediction for a given test example depends on the labels of the training examples in its local neighborhood. This motivates us to derive a realizable analytic expression that approximates the multi-class K-NN classification error in the presence of label noise, which is of independent importance. We then suggest that the expression for K-NN may serve as a first-order approximation for the DNN error. Finally, we demonstrate empirically the proximity of the developed expression to the observed performance of K-NN and DNN classifiers. Our result may explain the already observed surprising resistance of DNN to some types of label noise. It also characterizes an important factor of it showing that the more concentrated the noise the greater is the degradation in performance.

updated: Thu Dec 03 2020 09:18:17 GMT+0000 (UTC)

published: Fri Mar 30 2018 11:06:43 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト