How to trust unlabeled data? Instance Credibility Inference for Few-Shot Learning

Yikai Wang; Li Zhang; Yuan Yao; Yanwei Fu

ラベルのないデータを信頼する方法は？少数ショット学習のためのインスタンスの信頼性推論

ディープラーニングベースのモデルは、多くのコンピュータービジョンタスクで優れており、人間のパフォーマンスを上回っているように見えます。ただし、これらのモデルでは、大量の人間がラベル付けしたトレーニングデータの雪崩と、多数のパラメーターをトレーニングするための多くの反復が必要です。これにより、スケーラビリティが実際のロングテール分散カテゴリに大幅に制限されます。その一部には多数のインスタンスがありますが、手動で注釈が付けられているのはごくわずかです。このような非常に限定されたラベル付きの例からの学習は、Few-shot Learning（FSL）として知られています。メタ学習またはデータ拡張戦略を活用してこの非常にデータが不足している問題を軽減する従来の技術とは異なり、このペーパーでは、数ショットの視覚認識のためにラベルなしインスタンスのサポートを活用するための統計的アプローチ、インスタンス信頼性推論（ICI）を紹介します。通常、独学の学習パラダイムを再利用して、数ショットからトレーニングされた初期分類器を使用してラベルなしインスタンスの疑似ラベルを予測し、最も信頼できるものを選択してトレーニングセットを拡張し、分類器を再トレーニングします。これは、付随的パラメーターを使用して（一般化）線形モデル（LM / GLM）を構築し、（非）ラベル付き特徴から（疑似）ラベルへのマッピングをモデル化することによって実現されます。ここで、偶発的パラメーターの希薄性は、対応する疑似ラベル付きインスタンス。対応する付随パラメータの正則化パスに沿って疑似ラベル付きインスタンスの信頼性をランク付けし、最も信頼できる疑似ラベル付きの例を拡張ラベル付きインスタンスとして保持します。理論的には、制限された固有値、表現不能、および大きなエラーの穏やかな条件下で、私たちのアプローチは、ノイズの多い疑似ラベル付きセットからすべての正しく予測されたインスタンスを収集することが保証されます。

Deep learning based models have excelled in many computer vision tasks and appear to surpass humans' performance. However, these models require an avalanche of expensive human labeled training data and many iterations to train their large number of parameters. This severely limits their scalability to the real-world long-tail distributed categories, some of which are with a large number of instances, but with only a few manually annotated. Learning from such extremely limited labeled examples is known as Few-shot learning (FSL). Different to prior arts that leverage meta-learning or data augmentation strategies to alleviate this extremely data-scarce problem, this paper presents a statistical approach, dubbed Instance Credibility Inference (ICI) to exploit the support of unlabeled instances for few-shot visual recognition. Typically, we repurpose the self-taught learning paradigm to predict pseudo-labels of unlabeled instances with an initial classifier trained from the few shot and then select the most confident ones to augment the training set to re-train the classifier. This is achieved by constructing a (Generalized) Linear Model (LM/GLM) with incidental parameters to model the mapping from (un-)labeled features to their (pseudo-)labels, in which the sparsity of the incidental parameters indicates the credibility of the corresponding pseudo-labeled instance. We rank the credibility of pseudo-labeled instances along the regularization path of their corresponding incidental parameters, and the most trustworthy pseudo-labeled examples are preserved as the augmented labeled instances. Theoretically, under mild conditions of restricted eigenvalue, irrepresentability, and large error, our approach is guaranteed to collect all the correctly-predicted instances from the noisy pseudo-labeled set.

updated: Tue May 11 2021 03:21:15 GMT+0000 (UTC)

published: Wed Jul 15 2020 03:38:09 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト