Self-Supervised Robustifying Guidance for Monocular 3D Face Reconstruction

Hitika Tiwari; Min-Hung Chen; Yi-Min Tsai; Hsien-Kai Kuo; Hung-Jen Chen; Kevin Jou; K. S. Venkatesh; Yong-Sheng Chen

単眼 3D 顔再構成のための自己管理型堅牢化ガイダンス

遮蔽されたノイズの多い顔画像からの 3D 顔再構成の最近の開発にもかかわらず、パフォーマンスはまだ満足のいくものではありません。さらに、ほとんどの既存の方法は追加の依存関係に依存しており、トレーニング手順に多くの制約を課しています。したがって、顔画像のオクルージョンとノイズに対するロバスト性を得るために、Self-Supervised RObustifying GUIdancE (ROGUE) フレームワークを提案します。提案されたネットワークには、1) クリーンな顔の 3D 面係数を取得するためのガイダンスパイプラインと、2) 遮られた画像またはノイズの多い画像とクリーンな画像の推定係数の間の一貫性を取得するためのロバスト化パイプラインが含まれます。提案された画像レベルおよび機能レベルの損失関数は、追加の依存関係を引き起こすことなく ROGUE 学習プロセスを支援します。モデルの評価を容易にするために、ロバスト性評価のための実世界および合成オクルージョンベースの顔画像を含む、2 つの挑戦的なオクルージョン顔データセット ReaChOcc と SynChOcc を提案します。また、CelebA のテストデータセットのノイズを含むバリアントが評価用に作成されます。私たちの方法は、現在の最先端の方法よりも大幅に優れています (たとえば、知覚エラーについては、現実世界のオクルージョンで 23.8%、合成オクルージョンで 26.4%、ノイズの多い画像で 22.7% の削減)。提案されたアプローチの有効性。オクルージョンデータセットと対応する評価コードは、https://github.com/ArcTrinity9/Datasets-ReaChOcc-and-SynChOcc で公開されています。

Despite the recent developments in 3D Face Reconstruction from occluded and noisy face images, the performance is still unsatisfactory. Moreover, most existing methods rely on additional dependencies, posing numerous constraints over the training procedure. Therefore, we propose a Self-Supervised RObustifying GUidancE (ROGUE) framework to obtain robustness against occlusions and noise in the face images. The proposed network contains 1) the Guidance Pipeline to obtain the 3D face coefficients for the clean faces and 2) the Robustification Pipeline to acquire the consistency between the estimated coefficients for occluded or noisy images and the clean counterpart. The proposed image- and feature-level loss functions aid the ROGUE learning process without posing additional dependencies. To facilitate model evaluation, we propose two challenging occlusion face datasets, ReaChOcc and SynChOcc, containing real-world and synthetic occlusion-based face images for robustness evaluation. Also, a noisy variant of the test dataset of CelebA is produced for evaluation. Our method outperforms the current state-of-the-art method by large margins (e.g., for the perceptual errors, a reduction of 23.8% for real-world occlusions, 26.4% for synthetic occlusions, and 22.7% for noisy images), demonstrating the effectiveness of the proposed approach. The occlusion datasets and the corresponding evaluation code are released publicly at https://github.com/ArcTrinity9/Datasets-ReaChOcc-and-SynChOcc.

updated: Fri Oct 21 2022 04:38:14 GMT+0000 (UTC)

published: Wed Dec 29 2021 03:30:50 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト