GMM-Based Generative Adversarial Encoder Learning

Yuri Feigin; Hedva Spitzer; Raja Giryes

GMMベースの生成的敵対的エンコーダ学習

GANは画像を生成するための強力なモデルですが、潜在空間を推測できないため、エンコーダーを必要とするアプリケーションでの使用が直接制限されます。私たちの論文は、GANの生成機能とエンコーダーを組み合わせた単純なアーキテクチャのセットアップを示しています。これは、共有の重みを使用してエンコーダーとディスクリミネーターを組み合わせ、新しい損失項を使用してそれらを同時にトレーニングすることで実現します。 GMMを介してエンコーダー潜在空間の出力をモデル化します。これにより、この潜在空間を使用した優れたクラスタリングと、GANによる画像生成の改善の両方が実現します。私たちのフレームワークは一般的であり、GAN戦略に簡単に組み込むことができます。特に、VanillaGANとWassersteinGANの両方でそれを示します。どちらの場合も、ISスコアとFIDスコアの両方の観点から生成された画像の改善につながります。さらに、そのクラスタリング結果が現在のGANベースの最先端のクラスタリングと競合するため、エンコーダが意味のある表現を学習することを示します。

While GAN is a powerful model for generating images, its inability to infer a latent space directly limits its use in applications requiring an encoder. Our paper presents a simple architectural setup that combines the generative capabilities of GAN with an encoder. We accomplish this by combining the encoder with the discriminator using shared weights, then training them simultaneously using a new loss term. We model the output of the encoder latent space via a GMM, which leads to both good clustering using this latent space and improved image generation by the GAN. Our framework is generic and can be easily plugged into any GAN strategy. In particular, we demonstrate it both with Vanilla GAN and Wasserstein GAN, where in both it leads to an improvement in the generated images in terms of both the IS and FID scores. Moreover, we show that our encoder learns a meaningful representation as its clustering results are competitive with the current GAN-based state-of-the-art in clustering.

updated: Tue Dec 08 2020 16:12:16 GMT+0000 (UTC)

published: Tue Dec 08 2020 16:12:16 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト