Semi-Supervised Few-Shot Classification with Deep Invertible Hybrid Models

Yusuke Ohtsubo; Tetsu Matsukawa; Einoshin Suzuki

ディープインバーシブルハイブリッドモデルによる半教師あり少数ショット分類

この論文では、半教師あり数ショット分類のために潜在空間レベルで識別学習と生成学習を統合するディープインバーシブルハイブリッドモデルを提案します。画像データから新種を分類するためのさまざまなタスクは、半教師あり数ショット分類としてモデル化できます。これは、ラベル付きおよびラベルなしのトレーニング例と、ターゲットクラスの小さなサポートセットを想定しています。クラスごとにいくつかのサポート例を使用してターゲットクラスを予測すると、ラベルのないトレーニング例のクラスラベルを繰り返し推定してトレーニングクラスの分類子を学習するセルフトレーニングなど、既存の半教師あり分類方法の学習タスクが困難になります。ラベルのないトレーニング例を効果的に活用するために、識別関数と生成学習を統合し、他の一般的な統合学習アプローチである以前のパラメーター結合よりもディープニューラルネットワークに適した複合尤度を目的関数として採用します。提案されたモデルでは、識別モデルと生成モデルはそれぞれ、さまざまな種類の数ショット学習で優れたパフォーマンスを示したプロトタイプネットワークと、他の3つの主要な方法とは異なり正確な周辺尤度を返す深い可逆モデルであるNormalizingFlowです。、VAE、GAN、および自己回帰モデル。私たちの主な独創性は、潜在空間レベルでのこれらのコンポーネントの統合にあります。これは、過剰適合を防ぐのに効果的です。 mini-ImageNetおよびVGG-Faceデータセットを使用した実験は、私たちの方法がセルフトレーニングベースのプロトタイプネットワークよりも優れていることを示しています。

In this paper, we propose a deep invertible hybrid model which integrates discriminative and generative learning at a latent space level for semi-supervised few-shot classification. Various tasks for classifying new species from image data can be modeled as a semi-supervised few-shot classification, which assumes a labeled and unlabeled training examples and a small support set of the target classes. Predicting target classes with a few support examples per class makes the learning task difficult for existing semi-supervised classification methods, including selftraining, which iteratively estimates class labels of unlabeled training examples to learn a classifier for the training classes. To exploit unlabeled training examples effectively, we adopt as the objective function the composite likelihood, which integrates discriminative and generative learning and suits better with deep neural networks than the parameter coupling prior, the other popular integrated learning approach. In our proposed model, the discriminative and generative models are respectively Prototypical Networks, which have shown excellent performance in various kinds of few-shot learning, and Normalizing Flow a deep invertible model which returns the exact marginal likelihood unlike the other three major methods, i.e., VAE, GAN, and autoregressive model. Our main originality lies in our integration of these components at a latent space level, which is effective in preventing overfitting. Experiments using mini-ImageNet and VGG-Face datasets show that our method outperforms selftraining based Prototypical Networks.

updated: Sat May 22 2021 05:55:16 GMT+0000 (UTC)

published: Sat May 22 2021 05:55:16 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト