Self-Supervised Models are Continual Learners

Enrico Fini; Victor G. Turrisi da Costa; Xavier Alameda-Pineda; Elisa Ricci; Karteek Alahari; Julien Mairal

自己監視モデルは継続的な学習者です

自己監視モデルは、ラベルのないデータを大規模にオフラインでトレーニングした場合、監視モデルと同等またはそれ以上の視覚的表現を生成することが示されています。ただし、データがモデルに順次提示される継続学習（CL）シナリオでは、その有効性は壊滅的に低下します。この論文では、表現の現在の状態を過去の状態にマッピングする予測ネットワークを追加することにより、自己監視損失関数をCLの蒸留メカニズムにシームレスに変換できることを示します。これにより、継続的な自己監視視覚表現学習のフレームワークを考案できます。これにより、（i）学習した表現の品質が大幅に向上し、（ii）いくつかの最先端の自己監視目標と互換性があります。（iii ）ハイパーパラメータの調整はほとんど、またはまったく必要ありません。さまざまなCL設定で6つの人気のある自己監視モデルをトレーニングすることにより、このアプローチの有効性を経験的に示します。

Self-supervised models have been shown to produce comparable or better visual representations than their supervised counterparts when trained offline on unlabeled data at scale. However, their efficacy is catastrophically reduced in a Continual Learning (CL) scenario where data is presented to the model sequentially. In this paper, we show that self-supervised loss functions can be seamlessly converted into distillation mechanisms for CL by adding a predictor network that maps the current state of the representations to their past state. This enables us to devise a framework for Continual self-supervised visual representation Learning that (i) significantly improves the quality of the learned representations, (ii) is compatible with several state-of-the-art self-supervised objectives, and (iii) needs little to no hyperparameter tuning. We demonstrate the effectiveness of our approach empirically by training six popular self-supervised models in various CL settings.

updated: Wed Dec 08 2021 10:39:13 GMT+0000 (UTC)

published: Wed Dec 08 2021 10:39:13 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト