VICReg: Variance-Invariance-Covariance Regularization for Self-Supervised Learning

Adrien Bardes; Jean Ponce; Yann LeCun

VICReg：自己教師あり学習のための分散-不変-共分散正則化

画像表現学習のための最近の自己教師あり方法は、同じ画像の異なるビューからの埋め込みベクトル間の一致を最大化することに基づいています。エンコーダが定数ベクトルを出力すると、簡単な解が得られます。この崩壊の問題は、多くの場合、明確な正当化や解釈が欠けている学習アーキテクチャの暗黙のバイアスによって回避されます。この論文では、VICReg（Variance-Invariance-Covariance Regularization）を紹介します。これは、各次元に沿った埋め込みの分散に関する単純な正則化項を使用して、崩壊の問題を明示的に回避する方法です。 VICRegは、分散項を冗長性の削減と共分散の正則化に基づく非相関メカニズムと組み合わせ、いくつかのダウンストリームタスクで最先端の結果を実現します。さらに、新しい分散項を他の方法に組み込むと、トレーニングが安定し、パフォーマンスが向上することを示します。

Recent self-supervised methods for image representation learning are based on maximizing the agreement between embedding vectors from different views of the same image. A trivial solution is obtained when the encoder outputs constant vectors. This collapse problem is often avoided through implicit biases in the learning architecture, that often lack a clear justification or interpretation. In this paper, we introduce VICReg (Variance-Invariance-Covariance Regularization), a method that explicitly avoids the collapse problem with a simple regularization term on the variance of the embeddings along each dimension individually. VICReg combines the variance term with a decorrelation mechanism based on redundancy reduction and covariance regularization, and achieves results on par with the state of the art on several downstream tasks. In addition, we show that incorporating our new variance term into other methods helps stabilize the training and leads to performance improvements.

updated: Fri Oct 15 2021 11:27:50 GMT+0000 (UTC)

published: Tue May 11 2021 09:53:21 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト