Autoregressive GAN for Semantic Unconditional Head Motion Generation

Louis Airale; Xavier Alameda-Pineda; Stéphane Lathuilière; Dominique Vaufreydaz

セマンティック無条件頭部運動生成のための自己回帰 GAN

低次元のセマンティック空間で静止した人間の顔をアニメートするための無条件の頭の動きの生成のタスクに取り組みます。現実的な頭の動きにほとんど重点を置かないオーディオを条件とする話している頭の生成から逸脱して、豊富な取得を可能にする GAN ベースのアーキテクチャを考案します。 GAN に関連する既知の注意事項を回避しながら、頭の動きのシーケンス。実験的に提案されたアーキテクチャの関連性を調べ、同様のタスクで最先端のパフォーマンスを示したモデルと比較します。

We address the task of unconditional head motion generation to animate still human faces in a low-dimensional semantic space.Deviating from talking head generation conditioned on audio that seldom puts emphasis on realistic head motions, we devise a GAN-based architecture that allows obtaining rich head motion sequences while avoiding known caveats associated with GANs.Namely, the autoregressive generation of incremental outputs ensures smooth trajectories, while a multi-scale discriminator on input pairs drives generation toward better handling of high and low frequency signals and less mode collapse.We demonstrate experimentally the relevance of the proposed architecture and compare with models that showed state-of-the-art performances on similar tasks.

updated: Wed Nov 02 2022 09:48:49 GMT+0000 (UTC)

published: Wed Nov 02 2022 09:48:49 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト