MDCS: More Diverse Experts with Consistency Self-distillation for Long-tailed Recognition

Qihao Zhao; Chen Jiang; Wei Hu; Fan Zhang; Jun Liu

MDCS: 一貫性のあるより多様な専門家による長期的な認識のための自己蒸留

最近、複数の専門家による手法により、ロングテール認識 (LTR) が大幅に向上しました。 LTR 向上に貢献するためにさらなる強化が必要な 2 つの側面を要約します。(1) より多様な専門家。 (2) 下位モデルの差異。ただし、以前の方法ではそれらをうまく処理できませんでした。この目的を達成するために、私たちは、以前の方法によって残されたギャップを埋めるために、一貫性自己蒸留 (MDCS) を備えたより多様な専門家を提案します。当社の MDCS アプローチは、多様性損失 (DL) と一貫性自己蒸留 (CS) という 2 つのコアコンポーネントで構成されています。詳細には、DL はさまざまなカテゴリへの焦点を制御することで、専門家の多様性を促進します。モデルの分散を減らすために、KL 発散を使用して、専門家の自己蒸留のために弱く拡張されたインスタンスのより豊富な知識を蒸留します。特に、偏った/ノイズの多い知識を回避するために、CS 用に正しく分類されたインスタンスを選択するための Confident Instance Sampling (CIS) を設計します。分析とアブレーションの研究では、以前の研究と比較して、私たちの方法が専門家の多様性を効果的に高め、モデルの分散を大幅に減少させ、認識精度を向上させることができることを実証します。さらに、DL と CS の役割は相互に強化し、連携しています。専門家の多様性は CS の恩恵を受けますが、CS は DL なしでは目覚ましい成果を達成できません。実験によると、当社の MDCS は、CIFAR10-LT、CIFAR100-LT、ImageNet-LT、Places-LT、iNaturalist 2018 を含む 5 つの一般的なロングテールベンチマークで最先端のベンチマークを 1% ～ 2% 上回るパフォーマンスを示しています。 https://github.com/fistyee/MDCS で入手できます。

Recently, multi-expert methods have led to significant improvements in long-tail recognition (LTR). We summarize two aspects that need further enhancement to contribute to LTR boosting: (1) More diverse experts; (2) Lower model variance. However, the previous methods didn't handle them well. To this end, we propose More Diverse experts with Consistency Self-distillation (MDCS) to bridge the gap left by earlier methods. Our MDCS approach consists of two core components: Diversity Loss (DL) and Consistency Self-distillation (CS). In detail, DL promotes diversity among experts by controlling their focus on different categories. To reduce the model variance, we employ KL divergence to distill the richer knowledge of weakly augmented instances for the experts' self-distillation. In particular, we design Confident Instance Sampling (CIS) to select the correctly classified instances for CS to avoid biased/noisy knowledge. In the analysis and ablation study, we demonstrate that our method compared with previous work can effectively increase the diversity of experts, significantly reduce the variance of the model, and improve recognition accuracy. Moreover, the roles of our DL and CS are mutually reinforcing and coupled: the diversity of experts benefits from the CS, and the CS cannot achieve remarkable results without the DL. Experiments show our MDCS outperforms the state-of-the-art by 1% ∼ 2% on five popular long-tailed benchmarks, including CIFAR10-LT, CIFAR100-LT, ImageNet-LT, Places-LT, and iNaturalist 2018. The code is available at https://github.com/fistyee/MDCS.

updated: Thu Nov 30 2023 11:58:12 GMT+0000 (UTC)

published: Sat Aug 19 2023 06:21:22 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト