FlexMatch: Boosting Semi-Supervised Learning with Curriculum Pseudo Labeling

Bowen Zhang; Yidong Wang; Wenxin Hou; Hao Wu; Jindong Wang; Manabu Okumura; Takahiro Shinozaki

FlexMatch：カリキュラム疑似ラベリングによる半教師あり学習の強化

最近提案されたFixMatchは、ほとんどの半教師あり学習（SSL）ベンチマークで最先端の結果を達成しました。ただし、他の最新のSSLアルゴリズムと同様に、FixMatchは、すべてのクラスに対して事前定義された一定のしきい値を使用して、トレーニングに寄与するラベルのないデータを選択するため、さまざまな学習ステータスやさまざまなクラスの学習障害を考慮することができません。この問題に対処するために、モデルの学習ステータスに応じてラベルなしデータを活用するカリキュラム学習アプローチであるカリキュラム疑似ラベリング（CPL）を提案します。 CPLの中核は、各タイムステップでさまざまなクラスのしきい値を柔軟に調整して、有益なラベルなしデータとその疑似ラベルを渡すことです。 CPLは、追加のパラメーターまたは計算（順方向または逆方向の伝播）を導入しません。 CPLをFixMatchに適用し、改善されたアルゴリズムをFlexMatchと呼びます。 FlexMatchは、さまざまなSSLベンチマークで最先端のパフォーマンスを実現します。特に、ラベル付けされたデータが非常に限られている場合やタスクが難しい場合に、強力なパフォーマンスを実現します。たとえば、FlexMatchは、クラスごとにラベルが4つしかない場合に、CIFAR-100およびSTL-10データセットでFixMatchよりもそれぞれ13.96％および18.96％のエラー率の削減を実現します。 CPLはまた、収束速度を大幅に向上させます。たとえば、FlexMatchはFixMatchの1/5のトレーニング時間しか使用できず、さらに優れたパフォーマンスを実現できます。さらに、CPLを他のSSLアルゴリズムに簡単に適応させ、それらのパフォーマンスを大幅に向上させることができることを示します。 https://github.com/TorchSSL/TorchSSLでコードをオープンソース化します。

The recently proposed FixMatch achieved state-of-the-art results on most semi-supervised learning (SSL) benchmarks. However, like other modern SSL algorithms, FixMatch uses a pre-defined constant threshold for all classes to select unlabeled data that contribute to the training, thus failing to consider different learning status and learning difficulties of different classes. To address this issue, we propose Curriculum Pseudo Labeling (CPL), a curriculum learning approach to leverage unlabeled data according to the model's learning status. The core of CPL is to flexibly adjust thresholds for different classes at each time step to let pass informative unlabeled data and their pseudo labels. CPL does not introduce additional parameters or computations (forward or backward propagation). We apply CPL to FixMatch and call our improved algorithm FlexMatch. FlexMatch achieves state-of-the-art performance on a variety of SSL benchmarks, with especially strong performances when the labeled data are extremely limited or when the task is challenging. For example, FlexMatch achieves 13.96% and 18.96% error rate reduction over FixMatch on CIFAR-100 and STL-10 datasets respectively, when there are only 4 labels per class. CPL also significantly boosts the convergence speed, e.g., FlexMatch can use only 1/5 training time of FixMatch to achieve even better performance. Furthermore, we show that CPL can be easily adapted to other SSL algorithms and remarkably improve their performances. We open-source our code at https://github.com/TorchSSL/TorchSSL.

updated: Sun Jan 16 2022 07:37:14 GMT+0000 (UTC)

published: Fri Oct 15 2021 02:53:38 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト