Zero-Shot AutoML with Pretrained Models

Ekrem Öztürk; Fabio Ferreira; Hadi S. Jomaa; Lars Schmidt-Thieme; Josif Grabocka; Frank Hutter

事前トレーニング済みモデルを使用したゼロショットAutoML

新しいデータセットDと低い計算バジェットを考えると、特にDが小さい場合、過剰適合のリスクを冒さずに、事前にトレーニングされたモデルを選択してDに微調整し、微調整ハイパーパラメーターを設定するにはどうすればよいでしょうか。ここでは、自動機械学習（AutoML）を拡張して、これらの選択を最適に行います。ドメインに依存しないメタ学習アプローチは、ゼロショット代理モデルを学習します。これにより、テスト時に、新しいデータセットDに対して適切な深層学習（DL）パイプライン（事前トレーニング済みモデルと微調整ハイパーパラメーターを含む）を選択できます。画像の解像度やクラスの数など、Dを説明する些細なメタ機能のみが与えられます。このゼロショットモデルをトレーニングするために、データセットの大規模なコレクションで多くのDLパイプラインのパフォーマンスデータを収集し、このデータでメタトレーニングを行って、ペアワイズランキングの目標を最小限に抑えます。 ChaLearn AutoDLチャレンジベンチマークのビジョントラックの厳しい制限時間の下でアプローチを評価し、すべてのチャレンジ候補を明らかに上回っています。

Given a new dataset D and a low compute budget, how should we choose a pre-trained model to fine-tune to D, and set the fine-tuning hyperparameters without risking overfitting, particularly if D is small? Here, we extend automated machine learning (AutoML) to best make these choices. Our domain-independent meta-learning approach learns a zero-shot surrogate model which, at test time, allows to select the right deep learning (DL) pipeline (including the pre-trained model and fine-tuning hyperparameters) for a new dataset D given only trivial meta-features describing D such as image resolution or the number of classes. To train this zero-shot model, we collect performance data for many DL pipelines on a large collection of datasets and meta-train on this data to minimize a pairwise ranking objective. We evaluate our approach under the strict time limit of the vision track of the ChaLearn AutoDL challenge benchmark, clearly outperforming all challenge contenders.

updated: Thu Jun 16 2022 22:52:08 GMT+0000 (UTC)

published: Thu Jun 16 2022 22:52:08 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト