Learning Graph Embeddings for Compositional Zero-shot Learning

Muhammad Ferjad Naeem; Yongqin Xian; Federico Tombari; Zeynep Akata

構成的ゼロショット学習のためのグラフ埋め込みの学習

構図ゼロショット学習の目標は、トレーニングセット内の観察された視覚的プリミティブ状態（例：古くてかわいい）とオブジェクト（例：車、犬）の見えない構図（例：老犬）を認識することです。同じ状態でも、たとえば犬の外観が車とは大幅に異なる可能性があるため、これは困難です。解決策として、画像の特徴、構成分類子、および視覚的プリミティブの潜在的表現をエンドツーエンドで学習する、Compositional Graph Embedding（CGE）と呼ばれる新しいグラフ定式化を提案します。私たちのアプローチの鍵は、グラフ構造内の状態、オブジェクト、およびそれらの構成間の依存関係を活用して、表示されている構成から表示されていない構成への関連する知識の伝達を強制することです。概念間のセマンティクスをエンコードする共同互換性を学習することにより、私たちのモデルは、WordNetのような外部の知識ベースに依存することなく、目に見えない構成への一般化を可能にします。挑戦的な一般化された構成上のゼロショット設定では、CGEがMIT-StatesとUT-Zapposの最先端を大幅に上回っていることを示しています。また、最近のGQAデータセットに基づいて、このタスクの新しいベンチマークを提案します。コードはhttps://github.com/ExplainableML/czslで入手できます。

In compositional zero-shot learning, the goal is to recognize unseen compositions (e.g. old dog) of observed visual primitives states (e.g. old, cute) and objects (e.g. car, dog) in the training set. This is challenging because the same state can for example alter the visual appearance of a dog drastically differently from a car. As a solution, we propose a novel graph formulation called Compositional Graph Embedding (CGE) that learns image features, compositional classifiers, and latent representations of visual primitives in an end-to-end manner. The key to our approach is exploiting the dependency between states, objects, and their compositions within a graph structure to enforce the relevant knowledge transfer from seen to unseen compositions. By learning a joint compatibility that encodes semantics between concepts, our model allows for generalization to unseen compositions without relying on an external knowledge base like WordNet. We show that in the challenging generalized compositional zero-shot setting our CGE significantly outperforms the state of the art on MIT-States and UT-Zappos. We also propose a new benchmark for this task based on the recent GQA dataset. Code is available at: https://github.com/ExplainableML/czsl

updated: Mon May 03 2021 19:12:00 GMT+0000 (UTC)

published: Wed Feb 03 2021 10:11:03 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト