Hidden Convexity of Wasserstein GANs: Interpretable Generative Models with Closed-Form Solutions

Arda Sahiner; Tolga Ergen; Batu Ozturkler; Burak Bartan; John Pauly; Morteza Mardani; Mert Pilanci

ワッサースタインGANの隠れた凸性：閉じた形の解をもつ解釈可能な生成モデル

生成的敵対的ネットワーク（GAN）は、データの複雑な分布をモデル化するために一般的に使用されます。 GANのジェネレーターとディスクリミネーターはどちらもニューラルネットワークによってモデル化されることが多く、ジェネレーターとディスクリミネーターに対してそれぞれ非凸面と非凹面の不透明な最適化問題を引き起こします。このようなネットワークは、最急降下法（GDA）を使用してヒューリスティックに最適化されることがよくありますが、最適化問題に鞍点が含まれているかどうか、またはヒューリスティック手法で実際に鞍点を見つけることができるかどうかは不明です。この作業では、凸型双対性のレンズを介して2層ニューラルネットワーク弁別器を使用したワッサースタインGANのトレーニングを分析し、さまざまなジェネレーターについて、凸最適化アプローチでワッサースタインGANを正確に解くことができる条件を公開します。凸凹ゲームとして表されます。この凸型の双対性の解釈を使用して、弁別器のさまざまな活性化関数の影響をさらに示します。私たちの観察は、CelebA画像生成のための線形ジェネレーターと二次活性化弁別器に対応する凸型アーキテクチャのプログレッシブトレーニングでのアプリケーションで、凸型解釈の力を示す数値結果で検証されます。実験のコードはhttps://github.com/ardasahiner/ProCoGANで入手できます。

Generative Adversarial Networks (GANs) are commonly used for modeling complex distributions of data. Both the generators and discriminators of GANs are often modeled by neural networks, posing a non-transparent optimization problem which is non-convex and non-concave over the generator and discriminator, respectively. Such networks are often heuristically optimized with gradient descent-ascent (GDA), but it is unclear whether the optimization problem contains any saddle points, or whether heuristic methods can find them in practice. In this work, we analyze the training of Wasserstein GANs with two-layer neural network discriminators through the lens of convex duality, and for a variety of generators expose the conditions under which Wasserstein GANs can be solved exactly with convex optimization approaches, or can be represented as convex-concave games. Using this convex duality interpretation, we further demonstrate the impact of different activation functions of the discriminator. Our observations are verified with numerical results demonstrating the power of the convex interpretation, with applications in progressive training of convex architectures corresponding to linear generators and quadratic-activation discriminators for CelebA image generation. The code for our experiments is available at https://github.com/ardasahiner/ProCoGAN.

updated: Mon Jul 12 2021 18:33:49 GMT+0000 (UTC)

published: Mon Jul 12 2021 18:33:49 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト