Test-Agnostic Long-Tailed Recognition by Test-Time Aggregating Diverse Experts with Self-Supervision

Yifan Zhang; Bryan Hooi; Lanqing Hong; Jiashi Feng

自己監視を備えたテスト時間集約の多様な専門家によるテストにとらわれないロングテール認識

ロングテールデータからクラスバランスモデルをトレーニングすることを目的とした既存のロングテール認識方法は、一般に、モデルが均一なテストクラス分布で評価されることを前提としています。ただし、実際のテストクラスの分布は、そのような仮定に違反することが多く（たとえば、ロングテールまたは逆ロングテールである）、実際のアプリケーションでは既存のメソッドが失敗する可能性があります。この作業では、テストにとらわれないロングテール認識と呼ばれる、より実用的なタスク設定を研究します。この場合、トレーニングクラスの分布はロングテールですが、テストクラスの分布は不明であり、任意に歪めることができます。クラスの不均衡の問題に加えて、このタスクには別の課題があります。トレーニングサンプルとテストサンプルの間のクラス分布のシフトが特定されていません。このタスクに対処するために、2つのソリューション戦略を提示するTest-time Aggregating Diverse Experts（TADE）と呼ばれる新しい方法を提案します。（1）さまざまなテスト分布の処理に優れた多様な専門家を訓練する新しいスキル-多様な専門家学習戦略単一のロングテールトレーニングディストリビューションから。（2）自己監視を活用して、さまざまなテスト配布を処理するために複数のエキスパートを集約する、新しいテスト時のエキスパート集約戦略。さらに、理論的には、私たちの方法が未知のテストクラス分布をシミュレートする証明可能な能力を持っていることを示しています。バニラとテストにとらわれないロングテール認識の両方で有望な結果は、TADEの有効性を検証します。コードはhttps://github.com/Vanint/TADE-AgnosticLTで入手できます。

Existing long-tailed recognition methods, aiming to train class-balance models from long-tailed data, generally assume the models would be evaluated on the uniform test class distribution. However, the practical test class distribution often violates such an assumption (e.g., being long-tailed or even inversely long-tailed), which would lead existing methods to fail in real-world applications. In this work, we study a more practical task setting, called test-agnostic long-tailed recognition, where the training class distribution is long-tailed while the test class distribution is unknown and can be skewed arbitrarily. In addition to the issue of class imbalance, this task poses another challenge: the class distribution shift between the training and test samples is unidentified. To address this task, we propose a new method, called Test-time Aggregating Diverse Experts (TADE), that presents two solution strategies: (1) a novel skill-diverse expert learning strategy that trains diverse experts to excel at handling different test distributions from a single long-tailed training distribution; (2) a novel test-time expert aggregation strategy that leverages self-supervision to aggregate multiple experts for handling various test distributions. Moreover, we theoretically show that our method has provable ability to simulate unknown test class distributions. Promising results on both vanilla and test-agnostic long-tailed recognition verify the effectiveness of TADE. Code is available at https://github.com/Vanint/TADE-AgnosticLT.

updated: Tue Jul 20 2021 04:10:31 GMT+0000 (UTC)

published: Tue Jul 20 2021 04:10:31 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト