MetaPose: Fast 3D Pose from Multiple Views without 3D Supervision

Ben Usman; Andrea Tagliasacchi; Kate Saenko; Avneesh Sud

MetaPose：3D監視なしの複数のビューからの高速3Dポーズ

ディープラーニングの時代では、キャリブレーションが不明な複数のカメラからの人間の姿勢推定は、これまでほとんど注目されていませんでした。このタスクを高精度で最小の遅延オーバーヘッドで実行するようにニューラルモデルをトレーニングする方法を示します。提案されたモデルは、複数のビューからのオクルージョンによる関節の位置の不確実性を考慮に入れており、トレーニングに必要なのは2Dキーポイントデータのみです。私たちの方法は、定評のあるHuman3.6Mデータセット、およびより挑戦的な野生のスキーポーズPTZデータセットで、従来のバンドル調整と弱く監視された単眼3Dベースラインの両方を上回ります。

In the era of deep learning, human pose estimation from multiple cameras with unknown calibration has received little attention to date. We show how to train a neural model to perform this task with high precision and minimal latency overhead. The proposed model takes into account joint location uncertainty due to occlusion from multiple views, and requires only 2D keypoint data for training. Our method outperforms both classical bundle adjustment and weakly-supervised monocular 3D baselines on the well-established Human3.6M dataset, as well as the more challenging in-the-wild Ski-Pose PTZ dataset.

updated: Thu Nov 25 2021 23:16:01 GMT+0000 (UTC)

published: Tue Aug 10 2021 18:39:56 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト