Coarse-to-Fine Gaze Redirection with Numerical and Pictorial Guidance

Jingjing Chen; Jichao Zhang; Enver Sangineto; Jiayuan Fan; Tao Chen; Nicu Sebe

数値的および絵画的ガイダンスによる粗視線から微細視線へのリダイレクト

視線リダイレクトは、特定の顔画像の視線を目的の方向（つまり、参照角度）に対して操作することを目的としており、ビデオ会議や集合写真の撮影など、多くの実際のシナリオに適用できます。ただし、このトピックに関するこれまでの作業には、主に2つの制限があります。（1）低品質の画像生成と（2）低いリダイレクト精度です。この論文では、粗いものから細かいものへの学習戦略と組み合わせて、数値と画像の両方の方向ガイダンスを活用する新しい視線リダイレクトフレームワークによってこれらの問題を軽減することを提案します。具体的には、粗い枝は、希望する視線に従って入力画像を歪める空間変換を学習します。一方、きめ細かいブランチは、条件付き残差画像学習とマルチタスク弁別器を備えたジェネレータネットワークで構成されます。この2番目のブランチは、以前にワープされた画像とグラウンドトゥルース画像の間のギャップを減らし、より細かいテクスチャの詳細を復元します。さらに、視線リダイレクトの精度をさらに向上させるための追加ガイドとして、絵の視線マップの説明と数値の角度を使用する数値および絵のガイダンスモジュール〜（NPG）を提案します。ベンチマークデータセットでの広範な実験は、提案された方法が、画質とリダイレクト精度の両方の点で最先端のアプローチよりも優れていることを示しています。コードはhttps://github.com/jingjingchen777/CFGRで入手できます。

Gaze redirection aims at manipulating the gaze of a given face image with respect to a desired direction (i.e., a reference angle) and it can be applied to many real life scenarios, such as video-conferencing or taking group photos. However, previous work on this topic mainly suffers of two limitations: (1) Low-quality image generation and (2) Low redirection precision. In this paper, we propose to alleviate these problems by means of a novel gaze redirection framework which exploits both a numerical and a pictorial direction guidance, jointly with a coarse-to-fine learning strategy. Specifically, the coarse branch learns the spatial transformation which warps input image according to desired gaze. On the other hand, the fine-grained branch consists of a generator network with conditional residual image learning and a multi-task discriminator. This second branch reduces the gap between the previously warped image and the ground-truth image and recovers finer texture details. Moreover, we propose a numerical and pictorial guidance module~(NPG) which uses a pictorial gazemap description and numerical angles as an extra guide to further improve the precision of gaze redirection. Extensive experiments on a benchmark dataset show that the proposed method outperforms the state-of-the-art approaches in terms of both image quality and redirection precision. The code is available at https://github.com/jingjingchen777/CFGR

updated: Thu Nov 26 2020 06:17:15 GMT+0000 (UTC)

published: Tue Apr 07 2020 01:17:27 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト