Generating a Fusion Image: One's Identity and Another's Shape

Donggyu Joo; Doyeon Kim; Junmo Kim

融合画像の生成：ある人のアイデンティティと別の人の形

2つの入力画像を操作して新しい画像を生成することは、生成的敵対的ネットワーク（GAN）の研究における興味深い研究課題です。入力画像xのアイデンティティと入力画像yの形状を持つ融合画像を生成する新しいGANベースのネットワークを提案します。私たちのネットワークは、教師なしの方法で3つ以上の画像データセットを同時にトレーニングできます。画像xのアイデンティティをキャッチするためのアイデンティティ損失LIと、yの形状を取得するための形状損失LSを定義します。さらに、ジェネレータを画像全体ではなく重要な部分に集中させるための、ミンパッチトレーニングと呼ばれる新しいトレーニング方法を提案します。 VGG Youtube Poseデータセット、Eyeデータセット（MPIIGazeおよびUnityEyes）、およびPhoto-Sketch-Cartoonデータセットで定性的な結果を示します。

Generating a novel image by manipulating two input images is an interesting research problem in the study of generative adversarial networks (GANs). We propose a new GAN-based network that generates a fusion image with the identity of input image x and the shape of input image y. Our network can simultaneously train on more than two image datasets in an unsupervised manner. We define an identity loss LI to catch the identity of image x and a shape loss LS to get the shape of y. In addition, we propose a novel training method called Min-Patch training to focus the generator on crucial parts of an image, rather than its entirety. We show qualitative results on the VGG Youtube Pose dataset, Eye dataset (MPIIGaze and UnityEyes), and the Photo-Sketch-Cartoon dataset.

updated: Wed Jan 26 2022 03:09:45 GMT+0000 (UTC)

published: Fri Apr 20 2018 06:00:31 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト