DICS-Net: Dictionary-guided Implicit-Component-Supervision Network for Few-Shot Classification

Shuai Shao; Lei Xing; Weifeng Liu; Yanjiang Wang; Baodi Liu

DICS-Net: 少数ショット分類のための辞書に基づく暗黙的コンポーネント監視ネットワーク

少数ショット分類 (FSC) タスクは、最近注目されている研究トピックです。これは、クロスカテゴリベースでラベル付けされたデータが不十分な場合の分類問題に対処することを目的としています。通常、研究者は基本データを使用して特徴抽出器を事前にトレーニングし、それを使用して新しいデータの特徴を抽出して認識します。特に、新しいセットにはいくつかの注釈付きサンプルしかなく、基本セットからの重複しないカテゴリがあるため、事前トレーニング済みの特徴抽出器は新しいデータに完璧に適応できません。この問題は、Feature-Extractor-Maladaptive (FEM) 問題と呼ばれています。この問題の根本的な原因から始めて、この論文では、FSC のパフォーマンスを改善するための新しいスキーム、辞書ガイド型暗黙的コンポーネント監視ネットワーク (DICS-Net) を提示します。ベースセットとノベルセットのカテゴリは異なりますが、サンプルのコンポーネントの構成は類似していると考えられます。たとえば、猫と犬の両方に脚と頭のコンポーネントが含まれています。実際、そのようなエンティティコンポーネントはクラス内で安定しています。それらは、優れたクロスカテゴリの汎用性と新しいカテゴリの一般化を備えています。ただし、多くの現実世界のシナリオでは、さまざまなカテゴリ (猫と飛行機など) の共通情報を見つけるのは容易ではないため、この仮定に基づくモデル化の可能性が妨げられます。したがって、最初に辞書ベースの Implicit-Component Generator (DICG) を設計して、異なるセットの共通情報をマイニングします。次に、暗黙的なコンポーネントベースの補助タスクを構築して、特徴抽出器の適応性を向上させます。 3 つのベンチマークデータセット (mini-ImageNet、tiered-ImageNet、および FC100) で実験を行います。最新技術と比較して 0.9% ～ 10.1% の改善が、当社の DICS-Net の効率を評価しています。

The few-shot classification (FSC) task has recently been a hot research topic. It aims to address the classification problem with insufficient labeled data on a cross-category basis. Typically, researchers pre-train a feature extractor with base data, then use it to extract the features of novel data and recognize them. Notably, the novel set only has a few annotated samples and has non-overlapped categories from the base set, which leads to that the pre-trained feature extractor can not adapt to the novel data flawlessly. We dub this problem as Feature-Extractor-Maladaptive (FEM) problem. Starting from the root cause of this problem, this paper presents a new scheme, Dictionary-guided Implicit-Component-Supervision Network (DICS-Net), to improve the performance of FSC. We believe that although the categories of base and novel sets are different, the composition of the sample's components is similar. For example, both cats and dogs contain leg and head components. Actually, such entity components are intra-class stable. They have fine cross-category versatility and new category generalization. However, in many real-world scenarios, common information of different categories (such as cats and airplanes) is not easy to find, which hinders the possibility of modeling based on this assumption. Therefore, we first design a Dictionary-based Implicit-Component Generator (DICG) to mine common information of different sets; then construct an implicit-component-based auxiliary task to improve the adaptability of the feature extractor. We conduct experiments on three benchmark datasets (mini-ImageNet, tiered-ImageNet, and FC100). The improvements of 0.9%-10.1% compared with state-of-the-arts have evaluated the efficiency of our DICS-Net.

updated: Sat Oct 15 2022 05:12:13 GMT+0000 (UTC)

published: Tue Mar 15 2022 09:13:35 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト