Global Convergence of MAML and Theory-Inspired Neural Architecture Search for Few-Shot Learning

Haoxiang Wang; Yite Wang; Ruoyu Sun; Bo Li

MAMLのグローバルコンバージェンスと理論に着想を得たニューラルアーキテクチャによる少数のショット学習の検索

モデルにとらわれないメタ学習（MAML）とその変形は、数ショット学習の一般的なアプローチになっています。ただし、ディープニューラルネット（DNN）の非凸性と、MAMLの2レベルの定式化により、DNNを使用したMAMLの理論的特性はほとんど不明なままです。この論文では、最初に、過剰にパラメータ化されたDNNを持つMAMLが、線形速度でグローバル最適に収束することが保証されていることを証明します。収束分析では、パラメーターが過剰なDNNを使用するMAMLは、メタニューラルタンジェントカーネル（MetaNTK）と呼ばれる新しいクラスのカーネルを使用したカーネル回帰と同等であることが示されています。次に、MetaNTKを使用してアーキテクチャをランク付けおよび選択する、数ショット学習のための新しいトレーニングフリーのニューラルアーキテクチャ検索（NAS）メソッドであるMetaNTK-NASを提案します。経験的に、2つの人気のある数ショット学習ベンチマークであるminiImageNetとtieredImageNetで、MetaNTK-NASを以前のNASメソッドと比較します。 MetaNTK-NASのパフォーマンスは、100倍以上のスピードアップを楽しみながら、数ショットの学習用に設計された最先端のNAS方式と同等またはそれ以上であることを示しています。 MetaNTK-NASの効率は、多くの実際のタスクでより実用的になると信じています。

Model-agnostic meta-learning (MAML) and its variants have become popular approaches for few-shot learning. However, due to the non-convexity of deep neural nets (DNNs) and the bi-level formulation of MAML, the theoretical properties of MAML with DNNs remain largely unknown. In this paper, we first prove that MAML with over-parameterized DNNs is guaranteed to converge to global optima at a linear rate. Our convergence analysis indicates that MAML with over-parameterized DNNs is equivalent to kernel regression with a novel class of kernels, which we name as Meta Neural Tangent Kernels (MetaNTK). Then, we propose MetaNTK-NAS, a new training-free neural architecture search (NAS) method for few-shot learning that uses MetaNTK to rank and select architectures. Empirically, we compare our MetaNTK-NAS with previous NAS methods on two popular few-shot learning benchmarks, miniImageNet, and tieredImageNet. We show that the performance of MetaNTK-NAS is comparable or better than the state-of-the-art NAS method designed for few-shot learning while enjoying more than 100x speedup. We believe the efficiency of MetaNTK-NAS makes itself more practical for many real-world tasks.

updated: Thu Mar 17 2022 07:31:19 GMT+0000 (UTC)

published: Thu Mar 17 2022 07:31:19 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト