ACE: Ally Complementary Experts for Solving Long-Tailed Recognition in One-Shot

Jiarui Cai; Yizhou Wang; Jenq-Neng Hwang

ACE：ワンショットでロングテール認識を解決するための同盟国の補完的な専門家

1段階のロングテール認識方法は、「シーソー」方式で全体的なパフォーマンスを向上させます。つまり、テールの分類を向上させるためにヘッドの精度を犠牲にするか、ヘッドの精度をさらに高くしますが、テールは無視します。既存のアルゴリズムは、多段階のトレーニングプロセス（不均衡なセットでの事前トレーニングと均衡されたセットでの微調整）によって、このようなトレードオフを回避します。有望なパフォーマンスを達成しますが、事前トレーニングされたモデルの一般化可能性に敏感であるだけでなく、分類器の事前トレーニングだけが適用できない検出やセグメンテーションなどの他のコンピュータービジョンタスクに簡単に統合することもできません。この論文では、一段階のロングテール認識スキーム、同盟補完専門家（ACE）を提案します。この場合、専門家は、トレーニングを支配するサブセットの中で最も知識のある専門家であり、他の専門家を補完します。 -見たことのないものに邪魔されることなく、カテゴリーを見た。過剰適合を回避するために、各専門家の学習ペースを調整する分布適応オプティマイザーを設計します。特別なベルやホイッスルがない場合、バニラACEは、CIFAR10-LT、CIFAR100-LT、ImageNet-LT、およびiNaturalistデータセットで現在の1ステージSOTAメソッドを3〜10％上回ります。また、1つの段階で多数派と少数派のカテゴリの精度を同時に向上させることにより、「シーソー」のトレードオフを打破した最初の企業であることが示されています。コードとトレーニング済みモデルはhttps://github.com/jrcai/ACEにあります。

One-stage long-tailed recognition methods improve the overall performance in a "seesaw" manner, i.e., either sacrifice the head's accuracy for better tail classification or elevate the head's accuracy even higher but ignore the tail. Existing algorithms bypass such trade-off by a multi-stage training process: pre-training on imbalanced set and fine-tuning on balanced set. Though achieving promising performance, not only are they sensitive to the generalizability of the pre-trained model, but also not easily integrated into other computer vision tasks like detection and segmentation, where pre-training of classifiers solely is not applicable. In this paper, we propose a one-stage long-tailed recognition scheme, ally complementary experts (ACE), where the expert is the most knowledgeable specialist in a sub-set that dominates its training, and is complementary to other experts in the less-seen categories without being disturbed by what it has never seen. We design a distribution-adaptive optimizer to adjust the learning pace of each expert to avoid over-fitting. Without special bells and whistles, the vanilla ACE outperforms the current one-stage SOTA method by 3-10% on CIFAR10-LT, CIFAR100-LT, ImageNet-LT and iNaturalist datasets. It is also shown to be the first one to break the "seesaw" trade-off by improving the accuracy of the majority and minority categories simultaneously in only one stage. Code and trained models are at https://github.com/jrcai/ACE.

updated: Thu Aug 05 2021 05:31:57 GMT+0000 (UTC)

published: Thu Aug 05 2021 05:31:57 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト