MetAug: Contrastive Learning via Meta Feature Augmentation

Jiangmeng Li; Wenwen Qiang; Changwen Zheng; Bing Su; Hui Xiong

MetAug：メタ機能拡張による対照学習

対照学習にとって重要なことは何ですか？対照学習は、有益な機能、つまり「ハード」（ポジティブまたはネガティブ）機能に大きく依存していると私たちは主張します。初期の作品には、複雑なデータ拡張と大きなバッチサイズまたはメモリバンクを適用することによるより有益な機能が含まれ、最近の作品は、有益な機能を探索するための精巧なサンプリングアプローチを設計しています。このような機能を検討する上での主な課題は、ランダムなデータ拡張を適用してソースマルチビューデータを生成することです。これにより、拡張データに有用な情報を常に追加することが不可能になります。その結果、そのような拡張データから学習された機能の有益性は制限されます。それに応じて、潜在空間の特徴を直接増強し、それによって大量の入力データなしで識別表現を学習することを提案します。メタ学習手法を実行して、エンコーダーのパフォーマンスを考慮してネットワークパラメーターを更新する拡張ジェネレーターを構築します。ただし、入力データが不十分な場合、エンコーダーが折りたたまれた機能を学習し、拡張ジェネレーターが誤動作する可能性があります。エンコーダーが縮退マッピングを学習することを回避するために、新しいマージン注入正則化が目的関数にさらに追加されます。 1つの勾配バックプロパゲーションステップですべての機能を対比するために、従来のコントラスト損失の代わりに、提案された最適化駆動型の統一されたコントラスト損失を採用します。経験的に、私たちの方法は、いくつかのベンチマークデータセットで最先端の結果を達成します。

What matters for contrastive learning? We argue that contrastive learning heavily relies on informative features, or "hard" (positive or negative) features. Early works include more informative features by applying complex data augmentations and large batch size or memory bank, and recent works design elaborate sampling approaches to explore informative features. The key challenge toward exploring such features is that the source multi-view data is generated by applying random data augmentations, making it infeasible to always add useful information in the augmented data. Consequently, the informativeness of features learned from such augmented data is limited. In response, we propose to directly augment the features in latent space, thereby learning discriminative representations without a large amount of input data. We perform a meta learning technique to build the augmentation generator that updates its network parameters by considering the performance of the encoder. However, insufficient input data may lead the encoder to learn collapsed features and therefore malfunction the augmentation generator. A new margin-injected regularization is further added in the objective function to avoid the encoder learning a degenerate mapping. To contrast all features in one gradient back-propagation step, we adopt the proposed optimization-driven unified contrastive loss instead of the conventional contrastive loss. Empirically, our method achieves state-of-the-art results on several benchmark datasets.

updated: Sat Nov 12 2022 19:42:01 GMT+0000 (UTC)

published: Thu Mar 10 2022 02:35:39 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト