Bidirectional Mapping Coupled GAN for Generalized Zero-Shot Learning

Tasfia Shermin; Shyh Wei Teng; Ferdous Sohel; Manzur Murshed; Guojun Lu

一般化されたゼロショット学習のための双方向マッピング結合GAN

双方向マッピングベースの一般化ゼロショット学習（GZSL）メソッドは、合成された機能の品質に依存して、表示されたデータと表示されていないデータを認識します。したがって、これらの方法では、見えないドメインの同時分布を学習し、ドメインの区別を維持することが重要です。ただし、GZSL問題設定では、見えないクラスのセマンティクスを使用できますが、既存のメソッドは、見えたデータの基本的な分布のみを学習します。ほとんどの方法は、ドメインの区別を保持することを無視し、学習した分布を使用して、表示されたデータと表示されていないデータを認識します。その結果、それらはうまく機能しません。この作業では、目に見えるクラスのセマンティクスと一緒に利用可能な目に見えないクラスのセマンティクスを利用し、強力な視覚的セマンティック結合を通じて同時分布を学習します。結合された生成的敵対的ネットワークをデュアルドメイン学習双方向マッピングモデルに拡張することにより、双方向マッピング結合された生成的敵対的ネットワーク（BMCoGAN）を提案します。さらに、Wasserstein生成的敵対的最適化を統合して、同時分布学習を監視します。合成された特徴にドメイン固有の情報を保持し、見られるクラスへのバイアスを減らすための損失最適化を設計します。これにより、合成された見られる特徴が実際に見られる特徴に向かってプッシュされ、合成された見えない特徴が実際に見られる特徴から引き離されます。ベンチマークデータセットでBMCoGANを評価し、最新の方法に対して優れたパフォーマンスを発揮します。

Bidirectional mapping-based generalized zero-shot learning (GZSL) methods rely on the quality of synthesized features to recognize seen and unseen data. Therefore, learning a joint distribution of seen-unseen domains and preserving domain distinction is crucial for these methods. However, existing methods only learn the underlying distribution of seen data, although unseen class semantics are available in the GZSL problem setting. Most methods neglect retaining domain distinction and use the learned distribution to recognize seen and unseen data. Consequently, they do not perform well. In this work, we utilize the available unseen class semantics alongside seen class semantics and learn joint distribution through a strong visual-semantic coupling. We propose a bidirectional mapping coupled generative adversarial network (BMCoGAN) by extending the coupled generative adversarial network into a dual-domain learning bidirectional mapping model. We further integrate a Wasserstein generative adversarial optimization to supervise the joint distribution learning. We design a loss optimization for retaining domain distinctive information in the synthesized features and reducing bias towards seen classes, which pushes synthesized seen features towards real seen features and pulls synthesized unseen features away from real seen features. We evaluate BMCoGAN on benchmark datasets and demonstrate its superior performance against contemporary methods.

updated: Fri Feb 19 2021 08:25:09 GMT+0000 (UTC)

published: Wed Dec 30 2020 06:11:29 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト