PLM: Partial Label Masking for Imbalanced Multi-label Classification

Kevin Duarte; Yogesh S. Rawat; Mubarak Shah

PLM：不均衡なマルチラベル分類のための部分的なラベルマスキング

ロングテールのラベル分布を持つ実世界のデータセットでトレーニングされたニューラルネットワークは、頻繁なクラスに偏っており、まれなクラスではパフォーマンスが低下します。各クラスの正と負のサンプルの比率の不均衡は、ネットワーク出力確率をグラウンドトゥルース分布からさらに歪めます。トレーニング中にこの比率を利用する方法、部分ラベルマスキング（PLM）を提案します。損失計算中にラベルを確率的にマスキングすることにより、このメソッドはクラスごとにこの比率のバランスを取り、少数派クラスの想起を改善し、頻繁なクラスの精度を改善します。この比率は、予測された分布とグラウンドトゥルース分布の間のKL発散を最小化することにより、ネットワークのパフォーマンスに基づいて適応的に推定されます。データの不均衡に対処する既存のアプローチのほとんどは、主にシングルラベル分類に焦点を当てており、マルチラベルの場合にはあまり一般化されていませんが、この作業では、マルチラベル分類のロングテールデータ不均衡の問題を解決するための一般的なアプローチを提案します。 PLMは用途が広く、ほとんどの目的関数に適用でき、クラスの不均衡に対する他の戦略と一緒に使用できます。私たちの方法は、マルチラベル（MultiMNISTとMSCOCO）とシングルラベル（不均衡なCIFAR-10とCIFAR-100）の両方の画像分類データセットで既存の方法と比較して強力なパフォーマンスを実現します。

Neural networks trained on real-world datasets with long-tailed label distributions are biased towards frequent classes and perform poorly on infrequent classes. The imbalance in the ratio of positive and negative samples for each class skews network output probabilities further from ground-truth distributions. We propose a method, Partial Label Masking (PLM), which utilizes this ratio during training. By stochastically masking labels during loss computation, the method balances this ratio for each class, leading to improved recall on minority classes and improved precision on frequent classes. The ratio is estimated adaptively based on the network's performance by minimizing the KL divergence between predicted and ground-truth distributions. Whereas most existing approaches addressing data imbalance are mainly focused on single-label classification and do not generalize well to the multi-label case, this work proposes a general approach to solve the long-tail data imbalance issue for multi-label classification. PLM is versatile: it can be applied to most objective functions and it can be used alongside other strategies for class imbalance. Our method achieves strong performance when compared to existing methods on both multi-label (MultiMNIST and MSCOCO) and single-label (imbalanced CIFAR-10 and CIFAR-100) image classification datasets.

updated: Sat May 22 2021 18:07:56 GMT+0000 (UTC)

published: Sat May 22 2021 18:07:56 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト