Pseudo-Labeling Based Practical Semi-Supervised Meta-Training for Few-Shot Learning

Xingping Dong; Shengcai Liao; Bo Du; Ling Shao

少数ショット学習のための擬似ラベリングに基づく実用的な半教師付きメタトレーニング

既存のほとんどの FSL (few-shot learning) メソッドでは、メタトレーニングに大量のラベル付きデータが必要であり、これが大きな制限となっています。ラベルの要件を減らすために、半教師付きメタトレーニング (SSMT) 設定が FSL 用に提案されています。これには、少数のラベル付きサンプルと、基本クラスのラベルなしサンプルの数のみが含まれます。ただし、この設定の既存のメソッドでは、ラベルのないセットからクラスを意識したサンプルを選択する必要があり、ラベルのないセットの仮定に違反しています。このホワイトペーパーでは、現実的なシナリオでの FSL の適用を容易にするために、真にラベル付けされていないデータを使用した実用的な半教師付きメタトレーニング設定を提案します。ラベル付けされたデータと真にラベル付けされていないデータの両方をより有効に活用するために、疑似ラベル付けベースのメタ学習 (PLML) と呼ばれるシンプルで効果的なメタトレーニングフレームワークを提案します。まず、一般的な半教師あり学習 (SSL) を介して分類器をトレーニングし、それを使用してラベルのないデータの疑似ラベルを取得します。次に、ラベル付けされたデータと疑似ラベル付けされたデータから少数ショットのタスクを構築し、ノイズラベルから FSL モデルをより適切に学習するために、特徴の平滑化とノイズ抑制を備えた新しい微調整方法を設計します。驚くべきことに、2 つの FSL データセットにわたる大規模な実験を通じて、この単純なメタトレーニングフレームワークが、限られたラベル付きデータの下でさまざまな FSL モデルのパフォーマンス低下を効果的に防ぎ、最先端の SSMT モデルよりも大幅に優れていることがわかりました。さらに、メタトレーニングの恩恵を受けて、私たちの方法は 2 つの代表的な SSL アルゴリズムも改善します。

Most existing few-shot learning (FSL) methods require a large amount of labeled data in meta-training, which is a major limit. To reduce the requirement of labels, a semi-supervised meta-training (SSMT) setting has been proposed for FSL, which includes only a few labeled samples and numbers of unlabeled samples in base classes. However, existing methods under this setting require class-aware sample selection from the unlabeled set, which violates the assumption of unlabeled set. In this paper, we propose a practical semi-supervised meta-training setting with truly unlabeled data to facilitate the applications of FSL in realistic scenarios. To better utilize both the labeled and truly unlabeled data, we propose a simple and effective meta-training framework, called pseudo-labeling based meta-learning (PLML). Firstly, we train a classifier via common semi-supervised learning (SSL) and use it to obtain the pseudo-labels of unlabeled data. Then we build few-shot tasks from labeled and pseudo-labeled data and design a novel finetuning method with feature smoothing and noise suppression to better learn the FSL model from noise labels. Surprisingly, through extensive experiments across two FSL datasets, we find that this simple meta-training framework effectively prevents the performance degradation of various FSL models under limited labeled data, and also significantly outperforms the state-of-the-art SSMT models. Besides, benefiting from meta-training, our method also improves two representative SSL algorithms as well.

updated: Tue Mar 28 2023 16:34:49 GMT+0000 (UTC)

published: Thu Jul 14 2022 10:53:53 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト