Synthetic Data Can Also Teach: Synthesizing Effective Data for Unsupervised Visual Representation Learning

Yawen Wu; Zhepeng Wang; Dewen Zeng; Yiyu Shi; Jingtong Hu

合成データからも学べる: 教師なし視覚表現学習のための効果的なデータの合成

自己教師あり学習アプローチである対照学習 (CL) は、ラベル付けされていないデータから視覚的表現を効果的に学習できます。 CL トレーニングデータが与えられると、生成モデルをトレーニングして合成データを生成し、実際のデータを補足することができます。 CL トレーニングに合成データと実際のデータの両方を使用すると、学習した表現の品質が向上する可能性があります。ただし、合成データは通常、実際のデータよりも品質が低く、合成データを使用しても、実際のデータを使用した場合と比較して CL が改善されない場合があります。この問題に取り組むために、共同サンプル生成と対照学習によってCLトレーニングを改善する2つの方法を備えたデータ生成フレームワークを提案します。最初のアプローチは、メインモデルのハードサンプルを生成します。ジェネレーターはメインモデルと共同で学習され、メインモデルのトレーニング状態に基づいてハードサンプルを動的にカスタマイズします。さらに、類似しているが異なるサンプルを正のペアとして生成するために、データジェネレータのペアが提案されています。共同学習では、正のペアの硬さは、それらの類似性を下げることによって徐々に増加します。複数のデータセットに関する実験結果は、CL に適用された提案されたデータ生成方法の優れた精度とデータ効率を示しています。たとえば、ImageNet-100、CIFAR-100、および CIFAR-10 では、線形分類の精度がそれぞれ約 4.0%、3.5%、および 2.6% 向上しています。さらに、線形分類では最大 2 倍のデータ効率、転移学習では最大 5 倍のデータ効率が達成されます。

Contrastive learning (CL), a self-supervised learning approach, can effectively learn visual representations from unlabeled data. Given the CL training data, generative models can be trained to generate synthetic data to supplement the real data. Using both synthetic and real data for CL training has the potential to improve the quality of learned representations. However, synthetic data usually has lower quality than real data, and using synthetic data may not improve CL compared with using real data. To tackle this problem, we propose a data generation framework with two methods to improve CL training by joint sample generation and contrastive learning. The first approach generates hard samples for the main model. The generator is jointly learned with the main model to dynamically customize hard samples based on the training state of the main model. Besides, a pair of data generators are proposed to generate similar but distinct samples as positive pairs. In joint learning, the hardness of a positive pair is progressively increased by decreasing their similarity. Experimental results on multiple datasets show superior accuracy and data efficiency of the proposed data generation methods applied to CL. For example, about 4.0%, 3.5%, and 2.6% accuracy improvements for linear classification are observed on ImageNet-100, CIFAR-100, and CIFAR-10, respectively. Besides, up to 2x data efficiency for linear classification and up to 5x data efficiency for transfer learning are achieved.

updated: Tue Nov 22 2022 02:50:17 GMT+0000 (UTC)

published: Mon Feb 14 2022 02:41:43 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト