Self-Distilled Self-Supervised Representation Learning

Jiho Jang; Seonhoon Kim; Kiyoon Yoo; Chaerin Kong; Jangho Kim; Nojun Kwak

自己蒸留自己教師あり表現学習

自己教師あり学習の最先端のフレームワークは、トランスフォーマーベースのモデルを十分に活用することで、従来の CNN モデルと比較してパフォーマンスが向上することを最近示しました。画像の 2 つのビューの相互情報を最大化するために努力している既存の作品は、最終的な表現に対照的な損失を適用します。教師あり体制での自己蒸留によって動機付けられ、中間表現が対照的な損失を介して最終層から学習できるようにすることで、これをさらに活用します。自己蒸留により、中間層はインスタンスの識別により適しているため、早期に終了したサブネットワークのパフォーマンスは、完全なネットワークのパフォーマンスからそれほど低下しません。これにより、最終レイヤーでも口実タスクが容易になり、表現が向上します。私たちの方法、Self-Distilled Self-Supervised Learning (SDSSL) は、さまざまなタスクやデータセットで ViT を使用して、競合するベースライン (SimCLR、BYOL、MoCo v3) よりも優れています。線形評価と k-NN プロトコルでは、SDSSL は最終層だけでなく、ほとんどの下位層でも優れたパフォーマンスをもたらします。さらに、定性的および定量的分析により、表現がトランス層に沿ってより効果的に形成される方法が示されます。コードは https://github.com/hagiss/SDSSL で入手できます。

State-of-the-art frameworks in self-supervised learning have recently shown that fully utilizing transformer-based models can lead to performance boost compared to conventional CNN models. Striving to maximize the mutual information of two views of an image, existing works apply a contrastive loss to the final representations. Motivated by self-distillation in the supervised regime, we further exploit this by allowing the intermediate representations to learn from the final layer via the contrastive loss. Through self-distillation, the intermediate layers are better suited for instance discrimination, making the performance of an early-exited sub-network not much degraded from that of the full network. This renders the pretext task easier also for the final layer, leading to better representations. Our method, Self-Distilled Self-Supervised Learning (SDSSL), outperforms competitive baselines (SimCLR, BYOL and MoCo v3) using ViT on various tasks and datasets. In the linear evaluation and k-NN protocol, SDSSL not only leads to superior performance in the final layers, but also in most of the lower layers. Furthermore, qualitative and quantitative analyses show how representations are formed more effectively along the transformer layers. Code is available at https://github.com/hagiss/SDSSL.

updated: Thu Nov 24 2022 01:41:31 GMT+0000 (UTC)

published: Thu Nov 25 2021 07:52:36 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト