DBN-Mix: Training Dual Branch Network Using Bilateral Mixup Augmentation for Long-Tailed Visual Recognition

Jae Soon Baik; In Young Yoon; Jun Won Choi

DBN-Mix: ロングテール視覚認識のためのバイラテラルミックスアップ増強を使用したデュアルブランチネットワークのトレーニング

ロングテールクラス分布から学習するという挑戦的な視覚認識タスクへの関心が高まっています。トレーニングデータセットの極端なクラスの不均衡により、少数派クラスのデータよりも多数派クラスのデータを認識することを優先するようにモデルにバイアスがかかります。さらに、マイノリティクラスのサンプルに多様性がないため、適切な表現を見つけることが難しくなっています。この論文では、バイラテラルミックスアップ増強と呼ばれる効果的なデータ増強方法を提案します。これは、ロングテール視覚認識のパフォーマンスを向上させることができます。バイラテラルミックスアップ拡張は、均一サンプラーと再調整されたサンプラーによって生成された 2 つのサンプルを結合し、トレーニングデータセットを拡張して、マイノリティクラスの表現学習を強化します。また、クラスごとの温度スケーリングを使用して分類子のバイアスを減らします。これにより、トレーニングフェーズでクラスごとに異なる方法でロジットがスケーリングされます。両方のアイデアをデュアルブランチネットワーク (DBN) フレームワークに適用し、バイラテラルミックスアップを伴うデュアルブランチネットワーク (DBN-Mix) という名前の新しいモデルを提示します。一般的なロングテール視覚認識データセットでの実験では、DBN-Mix がベースラインよりもパフォーマンスを大幅に改善し、提案された方法がベンチマークのいくつかのカテゴリで最先端のパフォーマンスを達成することが示されています。

There is growing interest in the challenging visual perception task of learning from long-tailed class distributions. The extreme class imbalance in the training dataset biases the model to prefer recognizing majority class data over minority class data. Furthermore, the lack of diversity in minority class samples makes it difficult to find a good representation. In this paper, we propose an effective data augmentation method, referred to as bilateral mixup augmentation, which can improve the performance of long-tailed visual recognition. The bilateral mixup augmentation combines two samples generated by a uniform sampler and a re-balanced sampler and augments the training dataset to enhance the representation learning for minority classes. We also reduce the classifier bias using class-wise temperature scaling, which scales the logits differently per class in the training phase. We apply both ideas to the dual-branch network (DBN) framework, presenting a new model, named dual-branch network with bilateral mixup (DBN-Mix). Experiments on popular long-tailed visual recognition datasets show that DBN-Mix improves performance significantly over baseline and that the proposed method achieves state-of-the-art performance in some categories of benchmarks.

updated: Sat Aug 20 2022 06:32:19 GMT+0000 (UTC)

published: Tue Jul 05 2022 17:01:27 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト