Make an Omelette with Breaking Eggs: Zero-Shot Learning for Novel Attribute Synthesis

Yu Hsuan Li; Tzu-Yin Chao; Ching-Chun Huang; Pin-Yu Chen; Wei-Chen Chiu

卵を割るオムレツを作る：新しい属性合成のためのゼロショット学習

ゼロショット分類問題の既存のアルゴリズムのほとんどは、通常、カテゴリ間の属性ベースのセマンティック関係に依存して、インスタンスを観察せずに新しいカテゴリの分類を実現します。ただし、ゼロショット分類モデルのトレーニングには、トレーニングデータセット内の各クラス（またはインスタンス）の属性ラベル付けが必要であり、これもコストがかかります。この目的のために、この論文では、新しい問題シナリオを取り上げます。「新しい属性検出器/分類器のゼロショット学習を導出し、それらを使用して、ラベル付け効率のためにデータセットに自動的に注釈を付けることができますか？」基本的に、手動で注釈が付けられた属性（つまり、表示された属性）を認識するように学習された検出器の小さなセットのみが与えられると、ゼロショット学習方法で新しい属性の検出器を合成することを目指します。私たちが提案する方法である属性のゼロショット学習（ZSLA）は、私たちの知る限りでは初めての方法であり、集合演算を適用して、最初に表示された属性を基本属性に分解してから再結合することにより、この新しい研究問題に取り組みます。これらの基本的な属性を新しいものに変換します。新規属性のセマンティクスを正確にキャプチャするための合成検出器の能力を検証し、他のベースラインアプローチと比較して検出とローカリゼーションの点で優れたパフォーマンスを示すために、広範な実験が行われます。さらに、Caltech-UCSD Birds-200-2011データセットで32個の表示された属性のみを使用することで、提案された方法で他の207個の新しい属性を合成できます。また、合成されたデータセットでトレーニングされたさまざまな一般化されたゼロショット分類アルゴリズムが属性検出器は、手動のグラウンドトゥルースアノテーションでトレーニングされたものと同等のパフォーマンスを提供できます。

Most of the existing algorithms for zero-shot classification problems typically rely on the attribute-based semantic relations among categories to realize the classification of novel categories without observing any of their instances. However, training the zero-shot classification models still requires attribute labeling for each class (or even instance) in the training dataset, which is also expensive. To this end, in this paper, we bring up a new problem scenario: "Are we able to derive zero-shot learning for novel attribute detectors/classifiers and use them to automatically annotate the dataset for labeling efficiency?" Basically, given only a small set of detectors that are learned to recognize some manually annotated attributes (i.e., the seen attributes), we aim to synthesize the detectors of novel attributes in a zero-shot learning manner. Our proposed method, Zero Shot Learning for Attributes (ZSLA), which is the first of its kind to the best of our knowledge, tackles this new research problem by applying the set operations to first decompose the seen attributes into their basic attributes and then recombine these basic attributes into the novel ones. Extensive experiments are conducted to verify the capacity of our synthesized detectors for accurately capturing the semantics of the novel attributes and show their superior performance in terms of detection and localization compared to other baseline approaches. Moreover, with using only 32 seen attributes on the Caltech-UCSD Birds-200-2011 dataset, our proposed method is able to synthesize other 207 novel attributes, while various generalized zero-shot classification algorithms trained upon the dataset re-annotated by our synthesized attribute detectors are able to provide comparable performance with those trained with the manual ground-truth annotations.

updated: Sun Nov 28 2021 15:45:54 GMT+0000 (UTC)

published: Sun Nov 28 2021 15:45:54 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト