Generative Adversarial Networks in Computer Vision: A Survey and Taxonomy

Zhengwei Wang; Qi She; Tomas E. Ward

コンピュータビジョンにおける生成的敵対的ネットワーク：調査と分類

生成的敵対的ネットワーク（GAN）は、過去数年間に広範囲にわたって研究されてきました。間違いなく、それらの最も重要な影響は、もっともらしい画像生成、画像から画像への変換、顔の属性操作および類似のドメインなどの課題で大きな進歩があったコンピュータービジョンの領域にあります。今日までに達成された重要な成功にもかかわらず、GANを現実世界の問題に適用することは依然として重大な課題を引き起こします。これらは、（1）高品質の画像の生成、（2）画像生成の多様性、（3）安定したトレーニングです。一般的なGANテクノロジがこれらの課題に対してどの程度進歩しているかに焦点を当てて、公開された科学文献でGAN関連の研究における最新技術の詳細なレビューを提供します。 GANアーキテクチャと損失関数のバリエーションに基づいて採用した便利な分類法により、このレビューをさらに構造化します。これまでにGANに関するいくつかのレビューが提示されていますが、コンピュータービジョンに関連する実際的な課題への取り組みの進展に基づいて、このフィールドのステータスを検討したものはありません。したがって、これらの課題に取り組むために、最も一般的なアーキテクチャバリアントおよび損失バリアントGANを確認し、批判的に議論します。私たちの目的は、コンピュータビジョンアプリケーションの重要な要件への関連する進捗状況に関するGAN研究のステータスの概要と重要な分析を提供することです。これを行う際に、GANがかなりの成功を収めているコンピュータービジョンの最も説得力のあるアプリケーションと、将来の研究の方向性に関するいくつかの提案についても説明します。この研究で研究されたGANバリアントに関連するコードは、https：//github.com/sheqi/GAN_Reviewにまとめられています。

Generative adversarial networks (GANs) have been extensively studied in the past few years. Arguably their most significant impact has been in the area of computer vision where great advances have been made in challenges such as plausible image generation, image-to-image translation, facial attribute manipulation and similar domains. Despite the significant successes achieved to date, applying GANs to real-world problems still poses significant challenges, three of which we focus on here. These are: (1) the generation of high quality images, (2) diversity of image generation, and (3) stable training. Focusing on the degree to which popular GAN technologies have made progress against these challenges, we provide a detailed review of the state of the art in GAN-related research in the published scientific literature. We further structure this review through a convenient taxonomy we have adopted based on variations in GAN architectures and loss functions. While several reviews for GANs have been presented to date, none have considered the status of this field based on their progress towards addressing practical challenges relevant to computer vision. Accordingly, we review and critically discuss the most popular architecture-variant, and loss-variant GANs, for tackling these challenges. Our objective is to provide an overview as well as a critical analysis of the status of GAN research in terms of relevant progress towards important computer vision application requirements. As we do this we also discuss the most compelling applications in computer vision in which GANs have demonstrated considerable success along with some suggestions for future research directions. Code related to GAN-variants studied in this work is summarized on https://github.com/sheqi/GAN_Review.

updated: Tue Dec 29 2020 11:49:06 GMT+0000 (UTC)

published: Tue Jun 04 2019 15:40:53 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト