MetaPose: Fast 3D Pose from Multiple Views without 3D Supervision

Ben Usman; Andrea Tagliasacchi; Kate Saenko; Avneesh Sud

MetaPose：3D監視なしの複数のビューからの高速3Dポーズ

最近、既知のカメラパラメータを使用した単眼およびマルチビューのポーズ推定が大幅に進歩しましたが、位置と方向が不明な複数のカメラからのポーズ推定はあまり注目されていませんでした。この論文では、正確な3Dポーズとカメラ推定を実行でき、複数のビューからのオクルージョンによる関節位置の不確実性を考慮し、トレーニングに2Dキーポイントデータのみを必要とするニューラルモデルをトレーニングする方法を示します。私たちの方法は、定評のあるHuman3.6Mデータセット、および移動カメラを備えたより挑戦的な野生のスキーポーズPTZデータセットで、従来のバンドル調整と弱く監視された単眼3Dベースラインの両方を上回ります。カメラモデル、カメラの数、初期化、および画像空間ジョイントのローカリゼーションによるエラーを、モデルによって導入された追加のエラーから分離する広範なアブレーション研究を提供します。

Recently, huge strides were made in monocular and multi-view pose estimation with known camera parameters, whereas pose estimation from multiple cameras with unknown positions and orientations received much less attention. In this paper, we show how to train a neural model that can perform accurate 3D pose and camera estimation, takes into account joint location uncertainty due occlusion from multiple views, and requires only 2D keypoint data for training. Our method outperforms both classical bundle adjustment and weakly-supervised monocular 3D baselines on the well-established Human3.6M dataset, as well as the more challenging in-the-wild Ski-Pose PTZ dataset with moving cameras. We provide an extensive ablation study separating the error due to the camera model, number of cameras, initialization, and image-space joint localization from the additional error introduced by our model.

updated: Tue Aug 10 2021 18:39:56 GMT+0000 (UTC)

published: Tue Aug 10 2021 18:39:56 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト