AgeFlow: Conditional Age Progression and Regression with Normalizing Flows

Zhizhong Huang; Shouzhen Chen; Junping Zhang; Hongming Shan

AgeFlow：フローを正規化する条件付き年齢の進行と回帰

年齢の進行と回帰は、それぞれ老化と若返りの効果を持つ特定の顔画像の写実的な外観を合成することを目的としています。既存の生成的敵対的ネットワーク（GAN）ベースの方法には、次の3つの主要な問題があります：1）生成された顔に強いゴーストアーティファクトを導入する不安定なトレーニング、2）性別や人種などの顔の属性に予期しない変化をもたらす対になっていないトレーニング、3）全単射ではない年齢マッピングにより、顔の変形の不確実性が高まります。これらの問題を克服するために、このペーパーでは、フローベースのモデルとGANの両方の利点を統合するAgeFlowと呼ばれる新しいフレームワークを提案します。提案されたAgeFlowには、3つの部分が含まれています。可逆ニューラルネットワークを介して特定の面を潜在空間にマッピングするエンコーダー、ソース潜在ベクトルをターゲットベクトルに変換する新しい可逆条件付き変換モジュール（ICTM）、および生成された潜在ベクトルを再構築するデコーダーです。同じエンコーダネットワークを使用して、ターゲットの潜在ベクトルから面します。すべての部分は可逆的であり、全単射年齢マッピングを実現します。 ICTMの目新しさは2つあります。まず、属性を意識した知識蒸留を提案し、他の無関係な属性を変更せずに年齢の進行の操作方向を学習し、顔の属性の予期しない変化を軽減します。次に、潜在空間でGANを使用して、学習した潜在ベクトルを実際のベクトルと区別できないようにすることを提案します。これは、画像ドメインでのGANの従来の使用よりもはるかに簡単です。実験結果は、2つのベンチマークデータセットで既存のGANベースの方法よりも優れたパフォーマンスを示しています。ソースコードはhttps://github.com/Hzzone/AgeFlowで入手できます。

Age progression and regression aim to synthesize photorealistic appearance of a given face image with aging and rejuvenation effects, respectively. Existing generative adversarial networks (GANs) based methods suffer from the following three major issues: 1) unstable training introducing strong ghost artifacts in the generated faces, 2) unpaired training leading to unexpected changes in facial attributes such as genders and races, and 3) non-bijective age mappings increasing the uncertainty in the face transformation. To overcome these issues, this paper proposes a novel framework, termed AgeFlow, to integrate the advantages of both flow-based models and GANs. The proposed AgeFlow contains three parts: an encoder that maps a given face to a latent space through an invertible neural network, a novel invertible conditional translation module (ICTM) that translates the source latent vector to target one, and a decoder that reconstructs the generated face from the target latent vector using the same encoder network; all parts are invertible achieving bijective age mappings. The novelties of ICTM are two-fold. First, we propose an attribute-aware knowledge distillation to learn the manipulation direction of age progression while keeping other unrelated attributes unchanged, alleviating unexpected changes in facial attributes. Second, we propose to use GANs in the latent space to ensure the learned latent vector indistinguishable from the real ones, which is much easier than traditional use of GANs in the image domain. Experimental results demonstrate superior performance over existing GANs-based methods on two benchmarked datasets. The source code is available at https://github.com/Hzzone/AgeFlow.

updated: Sat May 15 2021 15:02:07 GMT+0000 (UTC)

published: Sat May 15 2021 15:02:07 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト