Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks

Chelsea Finn; Pieter Abbeel; Sergey Levine

ディープネットワークの高速適応のためのモデルに依存しないメタ学習

勾配降下で訓練されたモデルと互換性があり、分類、回帰、強化学習などのさまざまな学習問題に適用できるという意味で、モデルに依存しないメタ学習のアルゴリズムを提案します。メタ学習の目標は、さまざまな学習タスクでモデルをトレーニングし、少数のトレーニングサンプルのみを使用して新しい学習タスクを解決できるようにすることです。このアプローチでは、モデルのパラメーターは明示的にトレーニングされ、新しいタスクからの少量のトレーニングデータを使用した少数の勾配ステップにより、そのタスクで良好な一般化パフォーマンスが生成されます。実際、このメソッドはモデルを調整して簡単に微調整できるようにします。このアプローチにより、2つの少数ショット画像分類ベンチマークで最先端のパフォーマンスが得られ、少数ショット回帰で良好な結果が得られ、ニューラルネットワークポリシーによるポリシー勾配強化学習の微調整が加速されることが実証されます。

We propose an algorithm for meta-learning that is model-agnostic, in the sense that it is compatible with any model trained with gradient descent and applicable to a variety of different learning problems, including classification, regression, and reinforcement learning. The goal of meta-learning is to train a model on a variety of learning tasks, such that it can solve new learning tasks using only a small number of training samples. In our approach, the parameters of the model are explicitly trained such that a small number of gradient steps with a small amount of training data from a new task will produce good generalization performance on that task. In effect, our method trains the model to be easy to fine-tune. We demonstrate that this approach leads to state-of-the-art performance on two few-shot image classification benchmarks, produces good results on few-shot regression, and accelerates fine-tuning for policy gradient reinforcement learning with neural network policies.

updated: Tue Jul 18 2017 16:45:29 GMT+0000 (UTC)

published: Thu Mar 09 2017 18:58:03 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト