Projected GANs Converge Faster

Axel Sauer; Kashyap Chitta; Jens Müller; Andreas Geiger

予測されたGANはより速く収束します

生成的敵対的ネットワーク（GAN）は高品質の画像を生成しますが、トレーニングは困難です。注意深い正則化、大量の計算、および高価なハイパーパラメータスイープが必要です。生成された実際のサンプルを、事前にトレーニングされた固定の特徴空間に投影することで、これらの問題を大幅に前進させます。弁別器が事前にトレーニングされたモデルのより深い層からの機能を完全に活用できないという発見に動機付けられて、チャネルと解像度全体で機能を混合するより効果的な戦略を提案します。 Projected GANは、画質、サンプル効率、収束速度を向上させます。さらに、最大1メガピクセルの解像度と互換性があり、22のベンチマークデータセットで最先端のフレシェ開始距離（FID）を向上させます。重要なのは、予測されたGANが以前の最低のFIDと最大40倍速く一致し、同じ計算リソースが与えられた場合、実時間を5日から3時間未満に短縮することです。

Generative Adversarial Networks (GANs) produce high-quality images but are challenging to train. They need careful regularization, vast amounts of compute, and expensive hyper-parameter sweeps. We make significant headway on these issues by projecting generated and real samples into a fixed, pretrained feature space. Motivated by the finding that the discriminator cannot fully exploit features from deeper layers of the pretrained model, we propose a more effective strategy that mixes features across channels and resolutions. Our Projected GAN improves image quality, sample efficiency, and convergence speed. It is further compatible with resolutions of up to one Megapixel and advances the state-of-the-art Fréchet Inception Distance (FID) on twenty-two benchmark datasets. Importantly, Projected GANs match the previously lowest FIDs up to 40 times faster, cutting the wall-clock time from 5 days to less than 3 hours given the same computational resources.

updated: Mon Nov 01 2021 15:11:01 GMT+0000 (UTC)

published: Mon Nov 01 2021 15:11:01 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト