Self Sparse Generative Adversarial Networks

Wenliang Qian; Yang Xu; Wangmeng Zuo; Hui Li

自己疎生成的敵対的ネットワーク

生成的敵対的ネットワーク（GAN）は、敵対的トレーニングを通じてデータ分散を学習する教師なし生成モデルです。ただし、最近の実験では、高次元のパラメーター空間での最適化の要件とゼロ勾配問題のために、GANのトレーニングが困難であることが示されました。この作業では、パラメータ空間を削減し、勾配消失問題を軽減する自己疎生成敵対的ネットワーク（Self-Sparse GAN）を提案します。 Self-Sparse GANでは、スパース分解とフィーチャマップ再結合を含むSelf-Adaptive Sparse Transform Module（SASTM）を設計します。これは、マルチチャネルフィーチャマップに適用してスパースフィーチャマップを取得できます。 Self-Sparse GANの重要なアイデアは、ジェネレーターのすべてのデコンボリューションレイヤーの後にSASTMを追加することです。これにより、マルチチャネルフィーチャマップのスパース性を利用して、パラメーター空間を適応的に減らすことができます。 SASTMは、ジェネレーターの畳み込みカーネルの重みの検索スペースを削減できるだけでなく、バッチ正規化レイヤーで意味のある機能を維持し、デコンボリューションレイヤーの重みを負から遠ざけることで、勾配消失問題を軽減できることを理論的に証明します。実験結果は、私たちの方法が、MNIST、Fashion-MNIST、CIFAR-10、STL-10、mini-ImageNet、CELEBA-HQ、LSUNベッドルーム、およびFIDの減少は4.76％〜21.84％です。

Generative Adversarial Networks (GANs) are an unsupervised generative model that learns data distribution through adversarial training. However, recent experiments indicated that GANs are difficult to train due to the requirement of optimization in the high dimensional parameter space and the zero gradient problem. In this work, we propose a Self Sparse Generative Adversarial Network (Self-Sparse GAN) that reduces the parameter space and alleviates the zero gradient problem. In the Self-Sparse GAN, we design a Self-Adaptive Sparse Transform Module (SASTM) comprising the sparsity decomposition and feature-map recombination, which can be applied on multi-channel feature maps to obtain sparse feature maps. The key idea of Self-Sparse GAN is to add the SASTM following every deconvolution layer in the generator, which can adaptively reduce the parameter space by utilizing the sparsity in multi-channel feature maps. We theoretically prove that the SASTM can not only reduce the search space of the convolution kernel weight of the generator but also alleviate the zero gradient problem by maintaining meaningful features in the Batch Normalization layer and driving the weight of deconvolution layers away from being negative. The experimental results show that our method achieves the best FID scores for image generation compared with WGAN-GP on MNIST, Fashion-MNIST, CIFAR-10, STL-10, mini-ImageNet, CELEBA-HQ, and LSUN bedrooms, and the relative decrease of FID is 4.76% ~ 21.84%.

updated: Tue Jan 26 2021 04:49:12 GMT+0000 (UTC)

published: Tue Jan 26 2021 04:49:12 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト