Revisiting Deep Local Descriptor for Improved Few-Shot Classification

Jun He; Richang Hong; Xueliang Liu; Mingliang Xu; Meng Wang

少数ショット分類を改善するためのディープローカル記述子の再検討

少数のショット分類は、少数のサポート画像に基づいて新しいクラスを理解するために深い学習者を迅速に適応させる問題を研究します。これに関連して、最近の研究努力は、クエリ画像とサポート画像の間の類似性を測定するますます複雑な分類器を設計することを目的としていますが、特徴の埋め込みの重要性はほとんど探求されていません。高度な分類器に依存する必要はなく、改善された特徴の埋め込みに直接適用される単純な分類器は、最先端の方法よりも優れている可能性があることを示します。この目的のために、DCAPという名前の新しい方法を紹介します。この方法では、密な分類と注意深いプーリングを活用して、埋め込みの品質を向上させる方法を調査します。具体的には、サンプルが豊富な基本クラスで学習者を事前トレーニングして、最初に密な分類問題を解決し、次にランダムにサンプリングされた一連の数ショットタスクで学習者を微調整して、数ショットのシーンリオまたはテストに適応させることを提案します。時間シーンリオ。広く使用されているグローバル平均プーリング（GAP）の代わりに注意深いプーリングを適用して、メタ微調整中の数ショット分類用の埋め込みを準備することにより、フィーチャマップをプールすることをお勧めします。注意深いプーリングは、ローカル記述子を再重み付けすることを学習し、学習者が意思決定の証拠として何を探しているかを説明します。 2つのベンチマークデータセットでの実験は、提案された方法が複数の数ショット設定で優れている一方で、より単純で説明しやすいことを示しています。コードはhttps://github.com/Ukeyboard/dcap/で入手できます。

Few-shot classification studies the problem of quickly adapting a deep learner to understanding novel classes based on few support images. In this context, recent research efforts have been aimed at designing more and more complex classifiers that measure similarities between query and support images, but left the importance of feature embeddings seldom explored. We show that the reliance on sophisticated classifier is not necessary and a simple classifier applied directly to improved feature embeddings can outperform state-of-the-art methods. To this end, we present a new method named DCAP in which we investigate how one can improve the quality of embeddings by leveraging Dense Classification and Attentive Pooling. Specifically, we propose to pre-train a learner on base classes with abundant samples to solve dense classification problem first and then fine-tune the learner on a bunch of randomly sampled few-shot tasks to adapt it to few-shot scenerio or the test time scenerio. We suggest to pool feature maps by applying attentive pooling instead of the widely used global average pooling (GAP) to prepare embeddings for few-shot classification during meta-finetuning. Attentive pooling learns to reweight local descriptors, explaining what the learner is looking for as evidence for decision making. Experiments on two benchmark datasets show the proposed method to be superior in multiple few-shot settings while being simpler and more explainable. Code is available at: https://github.com/Ukeyboard/dcap/.

updated: Tue Mar 30 2021 00:48:28 GMT+0000 (UTC)

published: Tue Mar 30 2021 00:48:28 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト