Unsupervised Visual Representation Learning via Mutual Information Regularized Assignment

Dong Hoon Lee; Sungik Choi; Hyunwoo Kim; Sae-Young Chung

相互情報正規化代入による教師なし視覚表現学習

この論文では、情報の最大化に触発された教師なし表現学習のための擬似ラベル付けアルゴリズムである相互情報正規化割り当て (MIRA) を提案します。与えられたモデル確率に近く、ラベルとデータ間の相互情報量を最大化する疑似ラベルを見つけるための最適化問題として、オンライン疑似ラベル付けを定式化します。不動点反復法を導出し、最適解への収束を証明します。ベースラインとは対照的に、疑似ラベル予測と組み合わせた MIRA は、追加のトレーニング手法や、サンプリング戦略、等分割制約などの人為的な制約を組み込むことなく、シンプルかつ効果的なクラスタリングベースの表現学習を可能にします。比較的小さなトレーニングエポックで、MIRA によって学習された表現線形/k-NN 評価や転移学習など、さまざまなダウンストリームタスクで最先端のパフォーマンスを実現します。特に、わずか 400 エポックで、ResNet-50 アーキテクチャを使用して ImageNet データセットに適用された方法は、75.6% の線形評価精度を達成します。

This paper proposes Mutual Information Regularized Assignment (MIRA), a pseudo-labeling algorithm for unsupervised representation learning inspired by information maximization. We formulate online pseudo-labeling as an optimization problem to find pseudo-labels that maximize the mutual information between the label and data while being close to a given model probability. We derive a fixed-point iteration method and prove its convergence to the optimal solution. In contrast to baselines, MIRA combined with pseudo-label prediction enables a simple yet effective clustering-based representation learning without incorporating extra training techniques or artificial constraints such as sampling strategy, equipartition constraints, etc. With relatively small training epochs, representation learned by MIRA achieves state-of-the-art performance on various downstream tasks, including the linear/k-NN evaluation and transfer learning. Especially, with only 400 epochs, our method applied to ImageNet dataset with ResNet-50 architecture achieves 75.6% linear evaluation accuracy.

updated: Fri Nov 04 2022 06:49:42 GMT+0000 (UTC)

published: Fri Nov 04 2022 06:49:42 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト