Omni-GAN: On the Secrets of cGANs and Beyond

Peng Zhou; Lingxi Xie; Bingbing Ni; Qi Tian

Omni-GAN：cGANの秘密とその先について

条件付き生成的敵対的ネットワーク（cGAN）の適切な弁別子を設計することは重要な問題でした。この論文では、2つの一般的な選択肢、射影ベースと分類ベースの弁別子を調査し、両方がcGANの学習能力に影響を与えるある種の欠点を抱えていることを明らかにします。次に、強力な弁別子をトレーニングし、正則化による過剰適合を回避するソリューションを紹介します。さらに、複数のターゲット（クラス、ドメイン、リアリティなど）を1つの損失関数に統合して、より幅広いアプリケーションを可能にします。 Omni-GANという名前のアルゴリズムは、簡単な変更を提案することで、投影ベースのcGANパフォーマンスを大幅に改善し、中/高解像度の画像を生成する新しい最先端技術を実現します（190.9の記録破りのIS ImageNet128×128）。さらに重要なことは、Omni-GANが投影ベースのcGANであるBigGANよりも大幅に優れている理由を実験的に説明し、cGANを最適化するための新しい可能な方向性を提供することです。コードはhttps://github.com/PeterouZh/Omni-GAN-PyTorchで入手できます。

It has been an important problem to design a proper discriminator for conditional generative adversarial networks (cGANs). In this paper, we investigate two popular choices, the projection-based and classification-based discriminators, and reveal that both of them suffer some kind of drawbacks that affect the learning ability of cGANs. Then, we present our solution that trains a powerful discriminator and avoids over-fitting with regularization. In addition, we unify multiple targets (class, domain, reality, etc.) into one loss function to enable a wider range of applications. Our algorithm, named Omni-GAN, by proposing a simple modification, improves the projection-based cGAN performance significantly and achieves a new state-of-the-art in generating mid/high-resolution images (a record-breaking IS of 190.9 on ImageNet 128×128). More importantly, we explain experimentally why Omni-GAN is significantly better than the projection-based cGAN, BigGAN, offering new possible directions for optimizing cGANs. Code is available at https://github.com/PeterouZh/Omni-GAN-PyTorch.

updated: Fri Feb 19 2021 05:33:05 GMT+0000 (UTC)

published: Thu Nov 26 2020 00:30:20 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト