Subclass-balancing Contrastive Learning for Long-tailed Recognition

Chengkai Hou; Jieyu Zhang; Haonan Wang; Tianyi Zhou

ロングテール認識のためのサブクラスバランシング対照学習

実際の機械学習アプリケーションでは、クラス分布が不均衡なロングテール認識が自然に発生します。データの再重み付け、リサンプリング、教師あり対比学習などの既存の手法は、ヘッドクラスとテールクラスのインスタンス間に不均衡をもたらすという代償を払ってクラスのバランスを強制します。これにより、前者の基礎となる豊富な意味論的な下部構造が無視され、後者のバイアスが誇張される可能性があります。。私たちは、各先頭クラスを末尾クラスと同様のサイズの複数のサブクラスにクラスタリングし、元のクラス間の 2 層のクラス階層を捕捉するための表現を強制する、新しい「サブクラスバランシング対比学習 (SBCL)」アプローチによってこれらの欠点を克服します。およびそのサブクラス。クラスタリングは表現空間で実行され、トレーニング中に更新されるため、サブクラスラベルはヘッドクラスの意味論的な下部構造を保持します。一方、末尾クラスのサンプルを過度に強調することはないため、個々のインスタンスは表現の学習に均等に寄与します。したがって、私たちの方法はインスタンスとサブクラスの両方のバランスを達成し、元のクラスラベルも異なるクラスのサブクラス間の対比学習を通じて学習します。ロングテールベンチマークデータセットのリストに基づいて SBCL を評価し、最先端のパフォーマンスを達成しました。さらに、SBCL の利点を検証するために、SBCL の広範な分析とアブレーション研究を紹介します。

Long-tailed recognition with imbalanced class distribution naturally emerges in practical machine learning applications. Existing methods such as data reweighing, resampling, and supervised contrastive learning enforce the class balance with a price of introducing imbalance between instances of head class and tail class, which may ignore the underlying rich semantic substructures of the former and exaggerate the biases in the latter. We overcome these drawbacks by a novel ``subclass-balancing contrastive learning (SBCL)'' approach that clusters each head class into multiple subclasses of similar sizes as the tail classes and enforce representations to capture the two-layer class hierarchy between the original classes and their subclasses. Since the clustering is conducted in the representation space and updated during the course of training, the subclass labels preserve the semantic substructures of head classes. Meanwhile, it does not overemphasize tail class samples, so each individual instance contribute to the representation learning equally. Hence, our method achieves both the instance- and subclass-balance, while the original class labels are also learned through contrastive learning among subclasses from different classes. We evaluate SBCL over a list of long-tailed benchmark datasets and it achieves the state-of-the-art performance. In addition, we present extensive analyses and ablation studies of SBCL to verify its advantages.

updated: Wed Jun 28 2023 05:08:43 GMT+0000 (UTC)

published: Wed Jun 28 2023 05:08:43 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト