On the Importance of Distractors for Few-Shot Classification

Rajshekhar Das; Yu-Xiong Wang; JoséM. F. Moura

少数ショット分類のためのディストラクタの重要性について

少数のショットの分類は、ラベル付けされたいくつかの例（通常、1〜5）から学習することにより、新しいタスクのカテゴリを分類することを目的としています。数ショット分類への効果的なアプローチには、大規模サンプルのベースドメインでトレーニングされた事前モデルが含まれます。このモデルは、新しい数ショットタスクで微調整され、一般化可能な表現が生成されます。ただし、十分なトレーニング例がないため、タスク固有の微調整は過剰適合する傾向があります。この問題を軽減するために、ベースドメインからのラベルのない例をディストラクタの形で再利用する対照学習に基づく新しい微調整アプローチを提案します。以前の作品で使用されたラベルのないデータの性質とは異なり、気を散らすものは、新しいカテゴリと重複しないクラスに属しています。そのような気晴らしを含めることで、数ショットの一般化を大幅に促進できることを初めて示します。私たちの技術的な目新しさには、数ショットのタスクで同じカテゴリを共有する例の確率的ペアと、タスク固有のネガティブとディストラクタの相対的な影響を制御する重み付け項が含まれます。微調整の目的の重要な側面は、ディストラクタラベルに依存しないため、さまざまなベースドメイン設定に適用できることです。最先端のアプローチと比較して、私たちの方法は、クロスドメインで最大12％、教師なし事前学習設定で最大5％の精度の向上を示しています。

Few-shot classification aims at classifying categories of a novel task by learning from just a few (typically, 1 to 5) labelled examples. An effective approach to few-shot classification involves a prior model trained on a large-sample base domain, which is then finetuned over the novel few-shot task to yield generalizable representations. However, task-specific finetuning is prone to overfitting due to the lack of enough training examples. To alleviate this issue, we propose a new finetuning approach based on contrastive learning that reuses unlabelled examples from the base domain in the form of distractors. Unlike the nature of unlabelled data used in prior works, distractors belong to classes that do not overlap with the novel categories. We demonstrate for the first time that inclusion of such distractors can significantly boost few-shot generalization. Our technical novelty includes a stochastic pairing of examples sharing the same category in the few-shot task and a weighting term that controls the relative influence of task-specific negatives and distractors. An important aspect of our finetuning objective is that it is agnostic to distractor labels and hence applicable to various base domain settings. Compared to state-of-the-art approaches, our method shows accuracy gains of up to 12% in cross-domain and up to 5% in unsupervised prior-learning settings.

updated: Mon Sep 20 2021 23:35:56 GMT+0000 (UTC)

published: Mon Sep 20 2021 23:35:56 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト