Autoregressive GAN for Semantic Unconditional Head Motion Generation

Louis Airale; Xavier Alameda-Pineda; Stéphane Lathuilière; Dominique Vaufreydaz

セマンティック無条件頭部運動生成のための自己回帰 GAN

この作業では、単一の参照ポーズから低次元のセマンティック空間で静止した人間の顔をアニメーション化するための無条件のヘッドモーション生成のタスクに取り組みます。現実的な頭の動きにほとんど重点を置かない従来のオーディオ調整されたトーキングヘッド生成とは異なり、低いエラー蓄積レベルを維持しながら、長時間にわたって豊富な頭の動きのシーケンスを合成することを学習する GAN ベースのアーキテクチャを考案しました。増分出力は滑らかな軌跡を保証し、入力ペアのマルチスケール弁別器は、高周波信号と低周波信号のより良い処理とモード崩壊の減少に向けて生成を駆動します.提案された方法の関連性を実験的に実証し、同様のタスクで最先端のパフォーマンスを達成しました。

In this work, we address the task of unconditional head motion generation to animate still human faces in a low-dimensional semantic space from a single reference pose. Different from traditional audio-conditioned talking head generation that seldom puts emphasis on realistic head motions, we devise a GAN-based architecture that learns to synthesize rich head motion sequences over long duration while maintaining low error accumulation levels.In particular, the autoregressive generation of incremental outputs ensures smooth trajectories, while a multi-scale discriminator on input pairs drives generation toward better handling of high- and low-frequency signals and less mode collapse.We experimentally demonstrate the relevance of the proposed method and show its superiority compared to models that attained state-of-the-art performances on similar tasks.

updated: Mon Apr 17 2023 09:45:22 GMT+0000 (UTC)

published: Wed Nov 02 2022 09:48:49 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト