Positive Label Is All You Need for Multi-Label Classification

Zhixiang Yuan; Kaixin Zhang; Tao Huang

マルチラベル分類に必要なのはポジティブラベルだけです

マルチラベル分類 (MLC) では、各画像にさまざまな意味ラベルを注釈することが難しいため、トレーニングデータに避けられないラベルノイズが発生します。ノイズの多いラベルの影響を軽減するために、既存の方法は主に、トレーニングされた MLC モデルを介してラベルの間違いを特定して修正することに専念しています。ただし、これらの方法でもトレーニング中に煩わしいノイズのあるラベルが含まれるため、ノイズのあるラベルが不正確に認識され、パフォーマンスが低下する可能性があります。この論文では、陰性ラベルが陽性ラベルよりも大幅に多く、ほとんどのノイズのあるラベルが陰性ラベルからのものであることを考慮して、データセット内のすべての陰性ラベルを直接破棄し、陽性ラベルなしマルチラベル分類と呼ばれる新しい方法を提案します。 (PU-MLC)。正のラベルなし学習を MLC タスクに拡張することで、私たちの方法は正のラベルとラベルなしデータのみを使用してモデルをトレーニングし、損失関数に適応再バランス係数と適応温度係数を導入して、ラベル分布と過度の平滑化における壊滅的な不均衡を緩和します。トレーニングにおける確率。当社の PU-MLC はシンプルかつ効果的で、MLC と部分ラベル付き MLC (MLC-PL) タスクの両方に適用できます。 MS-COCO および PASCAL VOC データセットに関する広範な実験により、PU-MLC が、さらに少ないアノテーションで MLC 設定と MLC-PL 設定の両方で大幅な改善を達成できることが実証されました。コードが公開されます。

Multi-label classification (MLC) suffers from the inevitable label noise in training data due to the difficulty in annotating various semantic labels in each image. To mitigate the influence of noisy labels, existing methods mainly devote to identifying and correcting the label mistakes via a trained MLC model. However, these methods still involve annoying noisy labels in training, which can result in imprecise recognition of noisy labels and weaken the performance. In this paper, considering that the negative labels are substantially more than positive labels, and most noisy labels are from the negative labels, we directly discard all the negative labels in the dataset, and propose a new method dubbed positive and unlabeled multi-label classification (PU-MLC). By extending positive-unlabeled learning into MLC task, our method trains model with only positive labels and unlabeled data, and introduces adaptive re-balance factor and adaptive temperature coefficient in the loss function to alleviate the catastrophic imbalance in label distribution and over-smoothing of probabilities in training. Our PU-MLC is simple and effective, and it is applicable to both MLC and MLC with partial labels (MLC-PL) tasks. Extensive experiments on MS-COCO and PASCAL VOC datasets demonstrate that our PU-MLC achieves significantly improvements on both MLC and MLC-PL settings with even fewer annotations. Code will be released.

updated: Wed Jun 28 2023 08:44:00 GMT+0000 (UTC)

published: Wed Jun 28 2023 08:44:00 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト