High-Fidelity and Freely Controllable Talking Head Video Generation

Yue Gao; Yuan Zhou; Jinglu Wang; Xiao Li; Xiang Ming; Yan Lu

高忠実度で自由に制御可能なトーキングヘッドビデオ生成

トーキングヘッド生成とは、特定のソース ID とターゲットモーションに基づいてビデオを生成することです。ただし、現在の方法は、生成されたビデオの品質と制御性を制限するいくつかの課題に直面しています。第 1 に、生成された面には予期しない変形や深刻な歪みが生じることがよくあります。第二に、運転中の画像はポーズや表情などの動きに関連する情報を明示的に解きほぐさないため、生成中のさまざまな属性の操作が制限されます。第 3 に、生成されたビデオには、隣接するフレーム間で抽出されたランドマークの不一致が原因で、ちらつきのアーティファクトが含まれる傾向があります。この論文では、頭のポーズと表情を自由に制御して、忠実度の高いトーキングヘッドビデオを生成する新しいモデルを提案します。私たちの方法は、自己監視学習ランドマークと 3D 顔モデルベースのランドマークの両方を活用して、モーションをモデル化します。また、新しいモーション認識マルチスケール機能アライメントモジュールを導入して、顔の歪みなしでモーションを効果的に転送します。さらに、機能コンテキスト適応および伝播モジュールを使用して、合成されたトーキングヘッドビデオの滑らかさを強化します。困難なデータセットでモデルを評価し、その最先端のパフォーマンスを実証します。詳細については、https://yuegao.me/PECHead をご覧ください。

Talking head generation is to generate video based on a given source identity and target motion. However, current methods face several challenges that limit the quality and controllability of the generated videos. First, the generated face often has unexpected deformation and severe distortions. Second, the driving image does not explicitly disentangle movement-relevant information, such as poses and expressions, which restricts the manipulation of different attributes during generation. Third, the generated videos tend to have flickering artifacts due to the inconsistency of the extracted landmarks between adjacent frames. In this paper, we propose a novel model that produces high-fidelity talking head videos with free control over head pose and expression. Our method leverages both self-supervised learned landmarks and 3D face model-based landmarks to model the motion. We also introduce a novel motion-aware multi-scale feature alignment module to effectively transfer the motion without face distortion. Furthermore, we enhance the smoothness of the synthesized talking head videos with a feature context adaptation and propagation module. We evaluate our model on challenging datasets and demonstrate its state-of-the-art performance. More information is available at https://yuegao.me/PECHead.

updated: Thu Apr 20 2023 09:02:41 GMT+0000 (UTC)

published: Thu Apr 20 2023 09:02:41 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト