Latents2Segments: Disentangling the Latent Space of Generative Models for Semantic Segmentation of Face Images

Snehal Singh Tomar; A. N. Rajagopalan

Latents2Segments：顔画像のセマンティックセグメンテーションのための生成モデルの潜在空間を解きほぐす

人間の顔の画像に対して意味のある制御されたスタイル編集を実行することを目的とした拡張現実および仮想現実アプリケーションの数が増えるにつれ、顔画像を解析して正確できめの細かいセマンティックセグメンテーションマップを作成するタスクの推進力は、これまで。この問題を解決する最先端の（SOTA）メソッドは、顔の構造や、表情やポーズなどの他の顔の属性に関する事前情報を深い分類アーキテクチャに組み込むことによって解決します。この作業での私たちの努力は、顔のセマンティック関心領域（ROI）に関する解きほぐしの注入後のダウンストリームタスクとしてこの操作を再構成することにより、SOTAマルチクラス顔セグメンテーションモデルに必要な事前処理操作と複雑な前処理操作を排除することです。ジェネレーティブオートエンコーダモデルの潜在空間で。 CelebAMask-HQおよびHELENデータセットでのモデルのパフォーマンスの結果を示します。私たちのモデルのエンコードされた潜在空間は、セマンティックROIに関して、他のSOTA作品よりも大幅に高い解きほぐしを実現します。さらに、顔画像のセマンティックセグメンテーションのダウンストリームタスクで公開されているSOTAと比較して、13％高速な推論速度と同等の精度を実現します。

With the advent of an increasing number of Augmented and Virtual Reality applications that aim to perform meaningful and controlled style edits on images of human faces, the impetus for the task of parsing face images to produce accurate and fine-grained semantic segmentation maps is more than ever before. Few State of the Art (SOTA) methods which solve this problem, do so by incorporating priors with respect to facial structure or other face attributes such as expression and pose in their deep classifier architecture. Our endeavour in this work is to do away with the priors and complex pre-processing operations required by SOTA multi-class face segmentation models by reframing this operation as a downstream task post infusion of disentanglement with respect to facial semantic regions of interest (ROIs) in the latent space of a Generative Autoencoder model. We present results for our model's performance on the CelebAMask-HQ and HELEN datasets. The encoded latent space of our model achieves significantly higher disentanglement with respect to semantic ROIs than that of other SOTA works. Moreover, it achieves a 13% faster inference rate and comparable accuracy with respect to the publicly available SOTA for the downstream task of semantic segmentation of face images.

updated: Tue Jul 05 2022 08:09:15 GMT+0000 (UTC)

published: Tue Jul 05 2022 08:09:15 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト