FLAME: Facial Landmark Heatmap Activated Multimodal Gaze Estimation

Neelabh Sinha; Michal Balazia; François Bremond

FLAME：顔のランドマークヒートマップでアクティブ化されたマルチモーダル視線推定

3D視線推定は、3D空間での人の視線を予測することです。同じものの個人に依存しないモデルは、被験者の解剖学的な違いのために精度が不足していますが、個人固有のキャリブレーションされた手法は、スケーラビリティに厳しい制約を追加します。これらの問題を克服するために、目のランドマークヒートマップを使用して目の解剖学的情報を組み合わせて、個人固有のキャリブレーションなしで正確な視線推定を取得する方法として、新しい手法であるフェイシャルランドマークヒートマップアクティブ化マルチモーダル視線推定（FLAME）を提案します。私たちの評価は、ベンチマークデータセットColumbiaGazeとEYEDIAPで約10％の改善という競争力のあるパフォーマンスを示しています。また、私たちの方法を検証するためにアブレーション研究を実施しています。

3D gaze estimation is about predicting the line of sight of a person in 3D space. Person-independent models for the same lack precision due to anatomical differences of subjects, whereas person-specific calibrated techniques add strict constraints on scalability. To overcome these issues, we propose a novel technique, Facial Landmark Heatmap Activated Multimodal Gaze Estimation (FLAME), as a way of combining eye anatomical information using eye landmark heatmaps to obtain precise gaze estimation without any person-specific calibration. Our evaluation demonstrates a competitive performance of about 10% improvement on benchmark datasets ColumbiaGaze and EYEDIAP. We also conduct an ablation study to validate our method.

updated: Sun Oct 10 2021 15:40:15 GMT+0000 (UTC)

published: Sun Oct 10 2021 15:40:15 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト