Unsupervised High-Fidelity Facial Texture Generation and Reconstruction

Ron Slossberg; Ibrahim Jubran; Ron Kimmel

教師なしの忠実度の高い顔のテクスチャの生成と再構築

単一の画像からの顔の3Dジオメトリとテクスチャの回復のタスクに取り組むために、長年にわたって多くの方法が提案されてきました。このような方法では、トレーニング中に3D顔スキャンに依存せずに、忠実度の高いテクスチャを提供できないことがよくあります。対照的に、3D顔生成の補完的なタスクはそれほど注目されていません。 GANが非常にリアルな顔画像を生成することが証明されている2Dテクスチャドメインとは対照的に、より挑戦的な3Dジオメトリドメインは、同じレベルのリアリズムと多様性にまだ追いついていない。この論文では、両方のタスク、ジオメトリとテクスチャの両方の生成、および忠実度の高いテクスチャの復元のための新しい統合パイプラインを提案します。私たちのテクスチャモデルは、スキャンされたテクスチャマップとは対照的に、教師なしの方法で自然画像から学習されます。私たちの知る限り、これはスキャンされたテクスチャに依存しない最初のそのような統合フレームワークです。私たちの新しいトレーニングパイプラインには、事前にトレーニングされた2D顔射ジェネレーターと深い特徴操作方法が組み込まれています。正確な3DMMフィッティングを適用することで、モデル化されたテクスチャを合成的に生成された背景画像にシームレスに統合し、背景、髪、歯、体を含むテクスチャモデルのリアルな構成を形成できます。これにより、2D画像生成のドメインからの転移学習を適用できるため、このドメインで得られた印象的な結果から大きな恩恵を受けることができます。生成タスクと再構築タスクでモデルを比較するいくつかの最近の方法に関する包括的な研究を提供します。広範な定性分析と定量分析が示すように、両方のタスクで最先端の結果を達成しています。

Many methods have been proposed over the years to tackle the task of facial 3D geometry and texture recovery from a single image. Such methods often fail to provide high-fidelity texture without relying on 3D facial scans during training. In contrast, the complementary task of 3D facial generation has not received as much attention. As opposed to the 2D texture domain, where GANs have proven to produce highly realistic facial images, the more challenging 3D geometry domain has not yet caught up to the same levels of realism and diversity. In this paper, we propose a novel unified pipeline for both tasks, generation of both geometry and texture, and recovery of high-fidelity texture. Our texture model is learned, in an unsupervised fashion, from natural images as opposed to scanned texture maps. To the best of our knowledge, this is the first such unified framework independent of scanned textures. Our novel training pipeline incorporates a pre-trained 2D facial generator coupled with a deep feature manipulation methodology. By applying precise 3DMM fitting, we can seamlessly integrate our modeled textures into synthetically generated background images forming a realistic composition of our textured model with background, hair, teeth, and body. This enables us to apply transfer learning from the domain of 2D image generation, thus, benefiting greatly from the impressive results obtained in this domain. We provide a comprehensive study on several recent methods comparing our model in generation and reconstruction tasks. As the extensive qualitative, as well as quantitative analysis, demonstrate, we achieve state-of-the-art results for both tasks.

updated: Sun Oct 10 2021 10:59:04 GMT+0000 (UTC)

published: Sun Oct 10 2021 10:59:04 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト