Exploiting Class Similarity for Machine Learning with Confidence Labels and Projective Loss Functions

Gautam Rajendrakumar Gare; John Michael Galeotti

信頼ラベルと射影損失関数を使用した機械学習のためのクラス類似性の活用

機械学習に使用されるクラスラベルは相互に関連性があり、特定のクラスラベルは互いに類似しています（たとえば、猫と犬の画像は猫と車の画像よりも類似しています）。クラス間のこのような類似性は、モデルがクラス間で混乱するため、モデルのパフォーマンスが低下する原因となることがよくあります。現在のラベリング技術では、そのような類似性情報を明示的に取得できません。このホワイトペーパーでは、代わりに、新しい信頼ラベルを使用して類似性情報を取得することにより、クラス間の類似性を活用します。信頼性ラベルは、クラス間の類似性または混乱の可能性を示す確率的ラベルです。多くの場合、モデルが特徴空間のクラスを区別するようにトレーニングされた後でも、同様のクラスの潜在空間はクラスター化されたままです。このタイプのクラスタリングを貴重な情報と見なし、新しい射影損失関数で活用します。当社の射影損失関数は、同様のクラスを混乱させるエラーに対する損失ペナルティを緩和する機能を備えた信頼ラベルで機能するように設計されています。ノイズの多いラベルは、クラスの類似性から生じる混乱の結果であると考えられるため、ノイズのあるラベルを使用してニューラルネットワークをトレーニングするために私たちのアプローチを使用します。標準の損失関数を使用する場合と比較して、パフォーマンスが向上していることを示しています。 CIFAR-10データセットを使用して詳細な分析を行い、ImageNetやFood-101Nなどのより大きなデータセットへの提案された方法の適用性を示します。

Class labels used for machine learning are relatable to each other, with certain class labels being more similar to each other than others (e.g. images of cats and dogs are more similar to each other than those of cats and cars). Such similarity among classes is often the cause of poor model performance due to the models confusing between them. Current labeling techniques fail to explicitly capture such similarity information. In this paper, we instead exploit the similarity between classes by capturing the similarity information with our novel confidence labels. Confidence labels are probabilistic labels denoting the likelihood of similarity, or confusability, between the classes. Often even after models are trained to differentiate between classes in the feature space, the similar classes' latent space still remains clustered. We view this type of clustering as valuable information and exploit it with our novel projective loss functions. Our projective loss functions are designed to work with confidence labels with an ability to relax the loss penalty for errors that confuse similar classes. We use our approach to train neural networks with noisy labels, as we believe noisy labels are partly a result of confusability arising from class similarity. We show improved performance compared to the use of standard loss functions. We conduct a detailed analysis using the CIFAR-10 dataset and show our proposed methods' applicability to larger datasets, such as ImageNet and Food-101N.

updated: Thu Mar 25 2021 04:49:44 GMT+0000 (UTC)

published: Thu Mar 25 2021 04:49:44 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト