Evaluating Representations with Readout Model Switching

Yazhe Li; Jorg Bornschein; Marcus Hutter

読み出しモデルの切り替えによる表現の評価

ディープラーニングの成功の多くは、優れた表現の学習に基づいていますが、その品質を評価する厳密な方法はありません。この論文では、表現の評価をモデル選択問題として扱い、評価メトリックを考案するために最小記述長 (MDL) 原則を使用することを提案します。読み出しモデルの容量を制限するという確立された慣行に反して、読み出しモデルのハイブリッド離散および連続値モデル空間を設計し、切り替え戦略を採用してそれらの予測を組み合わせます。 MDL スコアでは、モデルの複雑さとデータ効率が考慮されます。その結果、特定のタスクと表現に最も適したモデルが選択され、比較のための統一された尺度になります。提案されたメトリクスは、オンラインメソッドで効率的に計算できます。さまざまなダウンストリームタスクで、さまざまなアーキテクチャ (ResNet および ViT) と目的関数 (教師ありおよび自己教師あり) の事前トレーニング済みビジョンエンコーダーの結果を提示します。私たちの方法を精度ベースのアプローチと比較し、複数の読み出しモデルが使用されている場合、後者が一貫していないことを示します。最後に、モデルのスケーリング、推奨される読み出しモデル、データ効率など、評価によって明らかになった重要な特性について説明します。

Although much of the success of Deep Learning builds on learning good representations, a rigorous method to evaluate their quality is lacking. In this paper, we treat the evaluation of representations as a model selection problem and propose to use the Minimum Description Length (MDL) principle to devise an evaluation metric. Contrary to the established practice of limiting the capacity of the readout model, we design a hybrid discrete and continuous-valued model space for the readout models and employ a switching strategy to combine their predictions. The MDL score takes model complexity, as well as data efficiency into account. As a result, the most appropriate model for the specific task and representation will be chosen, making it a unified measure for comparison. The proposed metric can be efficiently computed with an online method and we present results for pre-trained vision encoders of various architectures (ResNet and ViT) and objective functions (supervised and self-supervised) on a range of downstream tasks. We compare our methods with accuracy-based approaches and show that the latter are inconsistent when multiple readout models are used. Finally, we discuss important properties revealed by our evaluations such as model scaling, preferred readout model, and data efficiency.

updated: Sun Feb 19 2023 14:08:01 GMT+0000 (UTC)

published: Sun Feb 19 2023 14:08:01 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト