Solving The Long-Tailed Problem via Intra- and Inter-Category Balance

Renhui Zhang; Tiancheng Lin; Rui Zhang; Yi Xu

カテゴリ内およびカテゴリ間のバランスを介して長い尾の問題を解決する

視覚認識のベンチマークデータセットは、データが均一に分布していることを前提としていますが、実際のデータセットはロングテール分布に従います。現在のアプローチでは、ロングテールの問題を処理して、リサンプリングまたは再重み付け戦略によってロングテールのデータセットを一様分布に変換します。これらのアプローチはテールクラスを強調しますが、ヘッドクラスの難しい例を無視するため、パフォーマンスが低下します。この論文では、カテゴリ内およびカテゴリ間のバランス戦略によって対応して解決される、ロングテール問題の難易度とサンプルサイズの不均衡を分離するために、カテゴリごとの適応精度を備えた新しい勾配調和メカニズムを提案します。具体的には、カテゴリ内バランスは、各カテゴリの難しい例に焦点を当てて決定境界を最適化し、カテゴリ間バランスは、各カテゴリを1つの単位として、決定境界のシフトを修正することを目的としています。広範な実験は、提案された方法がすべてのデータセットで他のアプローチよりも一貫して優れていることを示しています。

Benchmark datasets for visual recognition assume that data is uniformly distributed, while real-world datasets obey long-tailed distribution. Current approaches handle the long-tailed problem to transform the long-tailed dataset to uniform distribution by re-sampling or re-weighting strategies. These approaches emphasize the tail classes but ignore the hard examples in head classes, which result in performance degradation. In this paper, we propose a novel gradient harmonized mechanism with category-wise adaptive precision to decouple the difficulty and sample size imbalance in the long-tailed problem, which are correspondingly solved via intra- and inter-category balance strategies. Specifically, intra-category balance focuses on the hard examples in each category to optimize the decision boundary, while inter-category balance aims to correct the shift of decision boundary by taking each category as a unit. Extensive experiments demonstrate that the proposed method consistently outperforms other approaches on all the datasets.

updated: Fri Apr 22 2022 05:59:47 GMT+0000 (UTC)

published: Wed Apr 20 2022 05:36:20 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト