Mean Embeddings with Test-Time Data Augmentation for Ensembling of Representations

Arsenii Ashukha; Andrei Atanov; Dmitry Vetrov

表現のアンサンブルのためのテスト時間データ拡張を伴う平均埋め込み

一連のモデル（アンサンブル）の平均予測は、深層学習モデルの予測パフォーマンスと不確実性の推定を改善するために広く使用されています。同時に、検索、マッチング、レコメンデーションシステムなど、多くの機械学習システムは埋め込みに大きく依存しています。残念ながら、独立してトレーニングされたモデルの機能の不整合のため、埋め込みは、素朴で深いアンサンブルのようなアプローチでは改善できません。この作業では、表現のアンサンブルを調べ、アンサンブル表現のテスト時間拡張（MeTTA）シンプルでありながらパフォーマンスの高いレシピを使用した平均埋め込みを提案します。経験的に、MeTTAは、教師ありモデルと自己教師ありモデルの両方について、ImageNetでの線形評価の品質を大幅に向上させることを示しています。さらにエキサイティングなことに、MeTTA、画像検索、および変換不変モデルの間の接続を描画します。アンサンブルの成功を広めて、より高品質の表現を推論することは、アンサンブルの多くの新しいアプリケーションを開く重要なステップであると私たちは信じています。

Averaging predictions over a set of models -- an ensemble -- is widely used to improve predictive performance and uncertainty estimation of deep learning models. At the same time, many machine learning systems, such as search, matching, and recommendation systems, heavily rely on embeddings. Unfortunately, due to misalignment of features of independently trained models, embeddings, cannot be improved with a naive deep ensemble like approach. In this work, we look at the ensembling of representations and propose mean embeddings with test-time augmentation (MeTTA) simple yet well-performing recipe for ensembling representations. Empirically we demonstrate that MeTTA significantly boosts the quality of linear evaluation on ImageNet for both supervised and self-supervised models. Even more exciting, we draw connections between MeTTA, image retrieval, and transformation invariant models. We believe that spreading the success of ensembles to inference higher-quality representations is the important step that will open many new applications of ensembling.

updated: Wed Jul 14 2021 16:22:21 GMT+0000 (UTC)

published: Tue Jun 15 2021 10:49:46 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト