SelfPose: 3D Egocentric Pose Estimation from a Headset Mounted Camera

Denis Tome; Thiemo Alldieck; Patrick Peluse; Gerard Pons-Moll; Lourdes Agapito; Hernan Badino; Fernando De la Torre

SelfPose：ヘッドセットに取り付けられたカメラからの3D自己中心的なポーズ推定

ヘッドマウントVRデバイスのリムに取り付けられた下向きの魚眼カメラからキャプチャされた単眼画像からの自己中心的な3D体のポーズ推定のソリューションを提示します。この珍しい視点は、下半身と上半身の解像度に劇的な違いをもたらす深刻な自己閉塞と遠近法の歪みを伴う、独特の視覚的外観の画像につながります。 2D予測の不確実性の変化を考慮して設計された、新しいマルチブランチデコーダーを備えたエンコーダーデコーダーアーキテクチャを提案します。合成データセットと実世界のデータセットでの定量的評価は、私たちの戦略が最先端の自己中心的アプローチよりも精度の大幅な向上につながることを示しています。ラベル付けされたデータの不足に対処するために、大規模なフォトリアリスティックな合成データセットも導入しました。 xR-EgoPoseは、さまざまなスキントーン、体型、衣服を備えた人々の高品質なレンダリングを提供し、さまざまなアクションを実行します。私たちの実験は、新しい合成トレーニングコーパスの高い変動性が、実世界の映像への優れた一般化と、グラウンドトゥルースを使用した実世界のデータセットでの最先端の結果につながることを示しています。さらに、Human3.6Mベンチマークでの評価は、私たちの方法のパフォーマンスが、第三者の視点から見た3D人間のポーズのより古典的な問題に対する最高のパフォーマンスのアプローチと同等であることを示しています。

We present a solution to egocentric 3D body pose estimation from monocular images captured from downward looking fish-eye cameras installed on the rim of a head mounted VR device. This unusual viewpoint leads to images with unique visual appearance, with severe self-occlusions and perspective distortions that result in drastic differences in resolution between lower and upper body. We propose an encoder-decoder architecture with a novel multi-branch decoder designed to account for the varying uncertainty in 2D predictions. The quantitative evaluation, on synthetic and real-world datasets, shows that our strategy leads to substantial improvements in accuracy over state of the art egocentric approaches. To tackle the lack of labelled data we also introduced a large photo-realistic synthetic dataset. xR-EgoPose offers high quality renderings of people with diverse skintones, body shapes and clothing, performing a range of actions. Our experiments show that the high variability in our new synthetic training corpus leads to good generalization to real world footage and to state of theart results on real world datasets with ground truth. Moreover, an evaluation on the Human3.6M benchmark shows that the performance of our method is on par with top performing approaches on the more classic problem of 3D human pose from a third person viewpoint.

updated: Mon Nov 02 2020 16:18:06 GMT+0000 (UTC)

published: Mon Nov 02 2020 16:18:06 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト