Can Continual Learning Improve Long-Tailed Recognition? Toward a Unified Framework

Mahdiyar Molahasani; Michael Greenspan; Ali Etemad

継続的な学習はロングテール認識を向上させることができますか?統一されたフレームワークに向けて

ロングテール認識 (LTR) 問題は、異なるクラス間のサンプル数が大きく偏っている、非常に不均衡なデータセットから学習するというコンテキストで発生します。 LTR 手法は、より大きなヘッドセットとより小さなテールセットの両方を含むデータセットを正確に学習することを目的としています。損失関数の強い凸性の仮定の下で、完全なデータセットでトレーニングされた学習者の重みが、頭部で厳密にトレーニングされた同じ学習者の重みの上限内にあるという定理を提案します。次に、ヘッドとテールの学習を 2 つの別々の連続したステップとして扱うことにより、継続学習 (CL) メソッドは、ヘッドを忘れることなくテールを学習するように学習者の重みを効果的に更新できると主張します。まず、おもちゃの MNIST-LT データセットに対するさまざまな実験を行って理論的発見を検証します。次に、2 つの標準 LTR ベンチマーク (CIFAR100-LT および CIFAR10-LT) の不均衡な複数のバリエーションに対するいくつかの CL 戦略の有効性を評価し、標準 CL 手法がベースラインと比較して大幅なパフォーマンス向上を達成し、カスタマイズされたソリューションにアプローチできることを示します。 LTR用に作りました。また、自然に不均衡な Caltech256 データセットで CL を調査することで、現実世界のデータに対する CL 手法の適用可能性を評価し、最先端の分類器に対する CL の優位性を実証します。私たちの取り組みは、LTR と CL を統合するだけでなく、CL 手法の進歩を活用して LTR の課題に効果的に取り組むための道を切り開くものでもあります。

The Long-Tailed Recognition (LTR) problem emerges in the context of learning from highly imbalanced datasets, in which the number of samples among different classes is heavily skewed. LTR methods aim to accurately learn a dataset comprising both a larger Head set and a smaller Tail set. We propose a theorem where under the assumption of strong convexity of the loss function, the weights of a learner trained on the full dataset are within an upper bound of the weights of the same learner trained strictly on the Head. Next, we assert that by treating the learning of the Head and Tail as two separate and sequential steps, Continual Learning (CL) methods can effectively update the weights of the learner to learn the Tail without forgetting the Head. First, we validate our theoretical findings with various experiments on the toy MNIST-LT dataset. We then evaluate the efficacy of several CL strategies on multiple imbalanced variations of two standard LTR benchmarks (CIFAR100-LT and CIFAR10-LT), and show that standard CL methods achieve strong performance gains in comparison to baselines and approach solutions that have been tailor-made for LTR. We also assess the applicability of CL techniques on real-world data by exploring CL on the naturally imbalanced Caltech256 dataset and demonstrate its superiority over state-of-the-art classifiers. Our work not only unifies LTR and CL but also paves the way for leveraging advances in CL methods to tackle the LTR challenge more effectively.

updated: Fri Jun 23 2023 03:05:33 GMT+0000 (UTC)

published: Fri Jun 23 2023 03:05:33 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト