Improving GAN Training via Feature Space Shrinkage

Haozhe Liu; Wentian Zhang; Bing Li; Haoqian Wu; Nanjun He; Yawen Huang; Yuexiang Li; Bernard Ghanem; Yefeng Zheng

特徴空間の縮小による GAN トレーニングの改善

データ生成の優れた機能により、Generative Adversarial Networks (GAN) は教師なし学習においてかなりの注目を集めています。ただし、トレーニング分布はディスクリミネータに対して動的であり、画像表現が不安定になるため、GAN のトレーニングは困難です。この論文では、新しい視点、つまりロバストな画像分類から GAN のトレーニングの問題に取り組みます。堅牢な画像表現に関する研究に動機付けられて、GAN 用のシンプルで効果的なモジュール、つまり AdaptiveMix を提案します。これは、ディスクリミネーターの画像表現空間でトレーニングデータの領域を縮小します。特徴空間を直接バインドするのは難しいことを考慮して、ハードサンプルを構築し、ハードサンプルとイージーサンプルの間の特徴距離を絞り込むことを提案します。ハードサンプルは、トレーニング画像のペアを混合することによって構築されます。広く使用されている最先端の GAN アーキテクチャを使用して、AdaptiveMix の有効性を評価します。評価結果は、AdaptiveMix が GAN のトレーニングを容易にし、生成されたサンプルの画質を効果的に改善できることを示しています。また、AdaptiveMix に最先端の方法を装備することで、画像分類および Out-Of-Distribution (OOD) 検出タスクにさらに適用できることも示します。公開されている 7 つのデータセットに対する広範な実験では、この方法がベースラインのパフォーマンスを効果的に向上させることが示されています。コードは、https://github.com/WentianZhang-ML/AdaptiveMix で公開されています。

Due to the outstanding capability for data generation, Generative Adversarial Networks (GANs) have attracted considerable attention in unsupervised learning. However, training GANs is difficult, since the training distribution is dynamic for the discriminator, leading to unstable image representation. In this paper, we address the problem of training GANs from a novel perspective, i.e., robust image classification. Motivated by studies on robust image representation, we propose a simple yet effective module, namely AdaptiveMix, for GANs, which shrinks the regions of training data in the image representation space of the discriminator. Considering it is intractable to directly bound feature space, we propose to construct hard samples and narrow down the feature distance between hard and easy samples. The hard samples are constructed by mixing a pair of training images. We evaluate the effectiveness of our AdaptiveMix with widely-used and state-of-the-art GAN architectures. The evaluation results demonstrate that our AdaptiveMix can facilitate the training of GANs and effectively improve the image quality of generated samples. We also show that our AdaptiveMix can be further applied to image classification and Out-Of-Distribution (OOD) detection tasks, by equipping it with state-of-the-art methods. Extensive experiments on seven publicly available datasets show that our method effectively boosts the performance of baselines. The code is publicly available at https://github.com/WentianZhang-ML/AdaptiveMix.

updated: Thu Mar 02 2023 20:22:24 GMT+0000 (UTC)

published: Thu Mar 02 2023 20:22:24 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト