Deep Active Shape Model for Face Alignment and Pose Estimation

Ali Pourramezan Fard; Hojjat Abdollahi; Mohammad Mahoor

顔の位置合わせとポーズ推定のためのディープアクティブシェイプモデル

Active Shape Model（ASM）は、ターゲット構造を表すオブジェクト形状の統計モデルです。 ASMは、機械学習アルゴリズムをガイドして、オブジェクト（顔など）を表すポイントのセットを画像に適合させることができます。このホワイトペーパーでは、顔の位置合わせと実際の頭のポーズの推定のためにASMによって正規化された損失関数を備えた軽量の畳み込みニューラルネットワーク（CNN）アーキテクチャを紹介します。損失関数のASMベースの正則化項は、ネットワークがより速く学習し、より一般化するように導き、したがって、軽量のネットワークアーキテクチャでも困難な例を処理します。顔のランドマークポイントの検出と顔のポーズの推定を担当する損失関数でマルチタスクを定義します。複数の相関タスクを同時に学習すると、相乗効果が生まれ、個々のタスクのパフォーマンスが向上します。挑戦的なデータセットでの実験結果は、提案されたASM正則化損失関数が、非常に軽量なCNNアーキテクチャを使用して、顔のランドマークポイントの検出とポーズ推定で競争力のあるパフォーマンスを実現することを示しています。

Active Shape Model (ASM) is a statistical model of object shapes that represents a target structure. ASM can guide machine learning algorithms to fit a set of points representing an object (e.g., face) onto an image. This paper presents a lightweight Convolutional Neural Network (CNN) architecture with a loss function regularized by ASM for face alignment and estimating head pose in the wild. The ASM-based regularization term in the loss function would guide the network to learn faster, generalize better, and hence handle challenging examples even with light-weight network architecture. We define multi-tasks in our loss function that are responsible for detecting facial landmark points, as well as estimating face pose. Learning multiple correlated tasks simultaneously builds synergy and improves the performance of individual tasks. Experimental results on challenging datasets show that our proposed ASM regularized loss function achieves competitive performance for facial landmark points detection and pose estimation using a very light-weight CNN architecture.

updated: Sat Feb 27 2021 03:46:54 GMT+0000 (UTC)

published: Sat Feb 27 2021 03:46:54 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト