Self-Labeling Refinement for Robust Representation Learning with Bootstrap Your Own Latent

Siddhant Garg; Dhruval Jain

独自の潜在性をブートストラップすることによるロバストな表現学習のための自己ラベリングの改良

この作業では、2つの主要な目標に向けて取り組んできました。まず、Bootstrap Your Own Latent（BYOL）と呼ばれる非対照表現学習フレームワークにおけるバッチ正規化（BN）レイヤーの重要性を調査しました。 BYOLでの表現学習にはBN層は必要ないと結論付けるために、いくつかの実験を行いました。さらに、BYOLは画像の正のペアからのみ学習しますが、同じ入力バッチ内の他の意味的に類似した画像を無視します。 2番目の目標として、2つの新しい損失関数を導入して、同じ入力バッチの画像で意味的に類似したペアを決定し、それらの表現間の距離を縮めます。これらの損失関数は、クロスコサイン類似性損失（CCSL）とクロスシグモイド類似性損失（CSSL）です。提案された損失関数を使用すると、STL10データセットでCCSL損失（76.87％）を使用してBYOLフレームワークをトレーニングすることにより、Vanilla BYOL（71.04％）のパフォーマンスを超えることができます。 CSSL損失を使用してトレーニングされたBYOLは、VanillaBYOLと同等のパフォーマンスを発揮します。

In this work, we have worked towards two major goals. Firstly, we have investigated the importance of Batch Normalisation (BN) layers in a non-contrastive representation learning framework called Bootstrap Your Own Latent (BYOL). We conducted several experiments to conclude that BN layers are not necessary for representation learning in BYOL. Moreover, BYOL only learns from the positive pairs of images but ignores other semantically similar images in the same input batch. For the second goal, we have introduced two new loss functions to determine the semantically similar pairs in the same input batch of images and reduce the distance between their representations. These loss functions are Cross-Cosine Similarity Loss (CCSL) and Cross-Sigmoid Similarity Loss (CSSL). Using the proposed loss functions, we are able to surpass the performance of Vanilla BYOL (71.04%) by training the BYOL framework using CCSL loss (76.87%) on the STL10 dataset. BYOL trained using CSSL loss performs comparably with Vanilla BYOL.

updated: Sat Apr 09 2022 20:30:20 GMT+0000 (UTC)

published: Sat Apr 09 2022 20:30:20 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト