Test-Agnostic Long-Tailed Recognition by Test-Time Aggregating Diverse Experts with Self-Supervision

Yifan Zhang; Bryan Hooi; Lanqing Hong; Jiashi Feng

自己監視を備えたテスト時間集約多様な専門家によるテストにとらわれないロングテール認識

ロングテールデータからクラスバランスモデルをトレーニングすることを目的とした既存のロングテール認識方法は、一般に、モデルが均一なテストクラス分布で評価されることを前提としています。ただし、実際のテストクラスの分布は、この仮定に違反することが多く（たとえば、ロングテールまたは逆ロングテールである）、実際のアプリケーションでは既存のメソッドが失敗する可能性があります。この作業では、テストにとらわれないロングテール認識と呼ばれる、より実用的なタスク設定を研究します。この場合、トレーニングクラスの分布はロングテールですが、テストクラスの分布は不明であり、任意に歪めることができます。クラスの不均衡の問題に加えて、このタスクには別の課題があります。トレーニングサンプルとテストサンプルの間のクラス分布のシフトが特定されていません。このタスクを処理するために、2つのソリューション戦略を提示するTest-time Aggregating Diverse Expertsと呼ばれる新しい方法を提案します。（1）単一からのさまざまなクラス分布の処理に優れた多様な専門家をトレーニングする新しいスキル-多様な専門家学習戦略ロングテールトレーニングの配布。（2）自己監視を活用して、さまざまな未知のテスト分布を処理するために複数のエキスパートを集約する、新しいテスト時のエキスパート集約戦略。理論的には、私たちのメソッドがテストクラスの分布をシミュレートする実証可能な能力を持っていることを示しています。広範な実験により、私たちの方法が、バニラとテストにとらわれないロングテール認識の両方で新しい最先端のパフォーマンスを達成することが確認されます。この場合、任意に変化するテストクラスの分布を処理するには3人の専門家だけで十分です。コードはhttps://github.com/Vanint/TADE-AgnosticLTで入手できます。

Existing long-tailed recognition methods, aiming to train class-balanced models from long-tailed data, generally assume the models would be evaluated on the uniform test class distribution. However, practical test class distributions often violate this assumption (e.g., being long-tailed or even inversely long-tailed), which would lead existing methods to fail in real-world applications. In this work, we study a more practical task setting, called test-agnostic long-tailed recognition, where the training class distribution is long-tailed while the test class distribution is unknown and can be skewed arbitrarily. In addition to the issue of class imbalance, this task poses another challenge: the class distribution shift between the training and test samples is unidentified. To handle this task, we propose a new method, called Test-time Aggregating Diverse Experts, that presents two solution strategies: (1) a new skill-diverse expert learning strategy that trains diverse experts to excel at handling different class distributions from a single long-tailed training distribution; (2) a novel test-time expert aggregation strategy that leverages self-supervision to aggregate multiple experts for handling various unknown test distributions. We theoretically show that our method has a provable ability to simulate the test class distribution. Extensive experiments verify that our method achieves new state-of-the-art performance on both vanilla and test-agnostic long-tailed recognition, where only three experts are sufficient to handle arbitrarily varied test class distributions. Code is available at https://github.com/Vanint/TADE-AgnosticLT.

updated: Mon Nov 22 2021 11:37:21 GMT+0000 (UTC)

published: Tue Jul 20 2021 04:10:31 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト