MetaNODE: Prototype Optimization as a Neural ODE for Few-Shot Learning

Baoquan Zhang; Xutao Li; Shanshan Feng; Yunming Ye; Rui Ye

MetaNODE：少数ショット学習のためのニューラルODEとしてのプロトタイプ最適化

Few-Shot Learning（FSL）は難しい作業です。つまり、いくつかの例で新しいクラスを認識する方法はありますか？事前トレーニングベースの方法は、特徴抽出器を事前トレーニングし、平均ベースのプロトタイプを使用して正弦最近傍分類器を介して新規クラスを予測することにより、問題に効果的に取り組みます。それにもかかわらず、データが不足しているため、平均ベースのプロトタイプは通常バイアスがかかっています。この論文では、プロトタイプの最適化問題と見なすことにより、プロトタイプのバイアスを減らすことを試みます。この目的のために、プロトタイプを修正するための新しいメタ学習ベースのプロトタイプ最適化フレームワークを提案します。つまり、プロトタイプを最適化するためのメタオプティマイザーを導入します。既存のメタオプティマイザーもフレームワークに適合させることができますが、それらはすべて、重大な勾配バイアスの問題を見落としています。つまり、平均ベースの勾配推定もスパースデータにバイアスされています。この問題に対処するために、勾配とその流れをメタ知識と見なし、MetaNODEと呼ばれるプロトタイプを磨くための新しいニューラル常微分方程式（ODE）ベースのメタオプティマイザーを提案します。このメタオプティマイザーでは、最初に平均ベースのプロトタイプを初期プロトタイプと見なし、次にプロトタイプ最適化のプロセスをニューラルODEによって指定された連続時間ダイナミクスとしてモデル化します。勾配流推論ネットワークは、プロトタイプダイナミクスの連続勾配流の推定を学習するように注意深く設計されています。最後に、Neural ODEを解くことにより、最適なプロトタイプを取得できます。 miniImagenet、tieredImagenet、およびCUB-200-2011での広範な実験は、私たちの方法の有効性を示しています。

Few-Shot Learning (FSL) is a challenging task, i.e., how to recognize novel classes with few examples? Pre-training based methods effectively tackle the problem by pre-training a feature extractor and then predicting novel classes via a cosine nearest neighbor classifier with mean-based prototypes. Nevertheless, due to the data scarcity, the mean-based prototypes are usually biased. In this paper, we attempt to diminish the prototype bias by regarding it as a prototype optimization problem. To this end, we propose a novel meta-learning based prototype optimization framework to rectify prototypes, i.e., introducing a meta-optimizer to optimize prototypes. Although the existing meta-optimizers can also be adapted to our framework, they all overlook a crucial gradient bias issue, i.e., the mean-based gradient estimation is also biased on sparse data. To address the issue, we regard the gradient and its flow as meta-knowledge and then propose a novel Neural Ordinary Differential Equation (ODE)-based meta-optimizer to polish prototypes, called MetaNODE. In this meta-optimizer, we first view the mean-based prototypes as initial prototypes, and then model the process of prototype optimization as continuous-time dynamics specified by a Neural ODE. A gradient flow inference network is carefully designed to learn to estimate the continuous gradient flow for prototype dynamics. Finally, the optimal prototypes can be obtained by solving the Neural ODE. Extensive experiments on miniImagenet, tieredImagenet, and CUB-200-2011 show the effectiveness of our method.

updated: Mon Dec 06 2021 08:43:20 GMT+0000 (UTC)

published: Fri Mar 26 2021 09:16:46 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト