A Closer Look at Prototype Classifier for Few-shot Image Classification

Mingcheng Hou; Issei Sato

数ショットの画像分類のためのプロトタイプ分類器の詳細

プロトタイプネットワークは、メタ学習に基づくプロトタイプ分類器であり、メタテスト中にハイパーパラメータを調整せずにクラス固有のプロトタイプを構築することで、目に見えない例を分類するため、少数ショット学習に広く使用されています。興味深いことに、最近の研究は多くの注目を集めており、メタ学習アルゴリズムを使用しない微調整を備えた線形分類器が、典型的なネットワークと同等に機能することを示しています。ただし、モデルを新しい環境に適応させる場合、微調整には追加のハイパーパラメーターが必要です。さらに、数ショット学習の目的は、モデルを新しい環境にすばやく適応できるようにすることですが、新しいクラスが出現するたびに微調整を適用する必要があるため、迅速な適応が困難です。このホワイトペーパーでは、プロトタイプ分類器が微調整やメタ学習なしでどのように機能するかを分析します。標準の事前トレーニング済みモデルを使用して抽出された特徴ベクトルを直接使用してメタテストでプロトタイプ分類器を構築することは、事前トレーニング済みモデルの微調整と特徴ベクトルを使用したプロトタイプネットワークおよび線形分類器と同様に機能しないことが実験的にわかりました。したがって、プロトタイプネットワークにバインドされた新しい一般化を導き出し、特徴ベクトルのノルムの分散に焦点を当てることでパフォーマンスを向上できることを示します。ノルムの分散を最小化するためのいくつかの正規化方法を実験的に調査し、微調整やメタ学習なしでL2正規化と埋め込み空間変換を使用することで同じパフォーマンスが得られることを発見しました。

The prototypical network is a prototype classifier based on meta-learning and is widely used for few-shot learning because it classifies unseen examples by constructing class-specific prototypes without adjusting hyper-parameters during meta-testing. Interestingly, recent research has attracted a lot of attention, showing that a linear classifier with fine-tuning, which does not use a meta-learning algorithm, performs comparably with the prototypical network. However, fine-tuning requires additional hyper-parameters when adapting a model to a new environment. In addition, although the purpose of few-shot learning is to enable the model to quickly adapt to a new environment, fine-tuning needs to be applied every time a new class appears, making fast adaptation difficult. In this paper, we analyze how a prototype classifier works equally well without fine-tuning and meta-learning. We experimentally found that directly using the feature vector extracted using standard pre-trained models to construct a prototype classifier in meta-testing does not perform as well as the prototypical network and linear classifiers with fine-tuning and feature vectors of pre-trained models. Thus, we derive a novel generalization bound for the prototypical network and show that focusing on the variance of the norm of a feature vector can improve performance. We experimentally investigated several normalization methods for minimizing the variance of the norm and found that the same performance can be obtained by using the L2 normalization and embedding space transformation without fine-tuning or meta-learning.

updated: Thu Oct 14 2021 01:58:38 GMT+0000 (UTC)

published: Mon Oct 11 2021 08:28:43 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト