MetaNODE: Prototype Optimization as a Neural ODE for Few-Shot Learning

Baoquan Zhang; Xutao Li; Yunming Ye; Shanshan Feng; Rui Ye

MetaNODE：少数ショット学習のためのニューラルODEとしてのプロトタイプ最適化

少数ショット学習（FSL）は難しい作業です。つまり、いくつかの例で新しいクラスを認識する方法は？事前トレーニングベースの方法は、特徴抽出器を事前トレーニングすることで問題に効果的に取り組み、平均ベースのプロトタイプを使用して最近傍分類器を介して新しいクラスを予測します。それにもかかわらず、データが不足しているため、平均ベースのプロトタイプは通常バイアスがかかっています。この論文では、それをプロトタイプ最適化問題と見なすことにより、バイアスを減らします。既存のメタオプティマイザーも最適化に適用できますが、それらはすべて、重大な勾配バイアスの問題を見落としています。つまり、平均ベースの勾配推定も、不足しているデータにバイアスがかかっています。したがって、勾配自体をメタ知識と見なし、MetaNODEと呼ばれる新しいプロトタイプ最適化ベースのメタ学習フレームワークを提案します。具体的には、最初に平均ベースのプロトタイプを初期プロトタイプと見なし、次にプロトタイプ最適化のプロセスを神経常微分方程式（Neural ODE）で指定された連続時間ダイナミクスとしてモデル化します。勾配フロー推論ネットワークは、プロトタイプダイナミクスの連続勾配を推定することを学習するように注意深く設計されています。最後に、ルンゲクッタ法を使用してニューラルODEを解くことにより、最適なプロトタイプを取得できます。広範な実験により、提案された方法が以前の最先端の方法よりも優れた性能を得ることが実証されています。私たちのコードは、承認されると公開されます。

Few-Shot Learning (FSL) is a challenging task, i.e., how to recognize novel classes with few examples? Pre-training based methods effectively tackle the problem by pre-training a feature extractor and then predict novel classes via a nearest neighbor classifier with mean-based prototypes. Nevertheless, due to the data scarcity, the mean-based prototypes are usually biased. In this paper, we diminish the bias by regarding it as a prototype optimization problem. Although the existing meta-optimizers can also be applied for the optimization, they all overlook a crucial gradient bias issue, i.e., the mean-based gradient estimation is also biased on scarce data. Consequently, we regard the gradient itself as meta-knowledge and then propose a novel prototype optimization-based meta-learning framework, called MetaNODE. Specifically, we first regard the mean-based prototypes as initial prototypes, and then model the process of prototype optimization as continuous-time dynamics specified by a Neural Ordinary Differential Equation (Neural ODE). A gradient flow inference network is carefully designed to learn to estimate the continuous gradients for prototype dynamics. Finally, the optimal prototypes can be obtained by solving the Neural ODE using the Runge-Kutta method. Extensive experiments demonstrate that our proposed method obtains superior performance over the previous state-of-the-art methods. Our code will be publicly available upon acceptance.

updated: Fri Mar 26 2021 09:16:46 GMT+0000 (UTC)

published: Fri Mar 26 2021 09:16:46 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト