Surrogate Gradient Field for Latent Space Manipulation

Minjun Li; Yanghua Jin; Huachun Zhu

潜在空間操作のための代理勾配場

生成的敵対的ネットワーク（GAN）は、サンプリングされた潜在コードから高品質の画像を生成できます。最近の作品は、基礎となる潜在コードを操作することによって画像を編集しようとしますが、属性調整の基本的なタスクを超えることはめったにありません。キーポイントやキャプションなどの多次元条件での操作を可能にする最初の方法を提案します。具体的には、補助マッピングネットワークによって誘導される代理勾配フィールド（SGF）に基づいて、ターゲット条件を満たす新しい潜在コードを検索するアルゴリズムを設計します。定量的な比較のために、操作方法の解きほぐしを評価するためのメトリックを提案します。顔の属性調整タスクに関する徹底的な実験的分析は、私たちの方法が解きほぐしにおいて最先端の方法よりも優れていることを示しています。さらに、さまざまな条件モダリティのタスクにこの方法を適用して、この方法がキーポイントやキャプションなどの複雑な画像プロパティを変更できることを示します。

Generative adversarial networks (GANs) can generate high-quality images from sampled latent codes. Recent works attempt to edit an image by manipulating its underlying latent code, but rarely go beyond the basic task of attribute adjustment. We propose the first method that enables manipulation with multidimensional condition such as keypoints and captions. Specifically, we design an algorithm that searches for a new latent code that satisfies the target condition based on the Surrogate Gradient Field (SGF) induced by an auxiliary mapping network. For quantitative comparison, we propose a metric to evaluate the disentanglement of manipulation methods. Thorough experimental analysis on the facial attribute adjustment task shows that our method outperforms state-of-the-art methods in disentanglement. We further apply our method to tasks of various condition modalities to demonstrate that our method can alter complex image properties such as keypoints and captions.

updated: Mon Apr 19 2021 06:15:06 GMT+0000 (UTC)

published: Mon Apr 19 2021 06:15:06 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト