Peter Hardy; Srinandan Dasmahapatra; Hansung Kim

「独立したパーツを個別に教える」（TIPSy-GAN）：教師なし敵対2Dから3Dポーズ推定における精度と安定性の改善

"Teaching Independent Parts Separately" (TIPSy-GAN) : Improving Accuracy and Stability in Unsupervised Adversarial 2D to 3D Pose Estimation

TIPSy-GANを紹介します。これは、教師なしの敵対的2Dから3Dの人間の姿勢推定における精度と安定性を向上させるための新しいアプローチです。私たちの仕事では、人間の運動学的骨格を単一の空間的に共依存する構造として想定すべきではないことを示しています。実際、トレーニング中に完全な2Dポーズが提供されると、キーポイントの3D座標が他のすべてのキーポイントの2D座標に空間的に共依存するという固有のバイアスが学習されると考えられます。仮説を調査するために、以前の敵対的なアプローチに従いますが、運動学的骨格の空間的に独立した部分である胴体と脚で2つのジェネレーターをトレーニングします。自己整合性サイクルを改善することが評価エラーを下げるための鍵であり、したがってトレーニング中に新しい整合性制約を導入することがわかります。 TIPSyモデルは、これらのジェネレーターからの知識蒸留によって生成され、2Dポーズ全体の3D縦座標を予測して、結果を改善できます。さらに、真に監視されていないシナリオでトレーニングする期間についての以前の作業で、未回答の質問に対処します。 2つの独立した発電機について、敵対的に訓練することで、崩壊する単独の発電機よりも安定性が向上することを示します。 TIPSyは、Human3.6Mデータセットのベースラインソロジェネレーターと比較して、平均エラーを17％削減します。 TIPSyは、Human3.6MデータセットとMPI-INF-3DHPデータセットの両方での評価中に、他の教師なしアプローチを改善すると同時に、監視されたアプローチと弱く監視されたアプローチに対して強力に機能します。

We present TIPSy-GAN, a new approach to improve the accuracy and stability in unsupervised adversarial 2D to 3D human pose estimation. In our work we demonstrate that the human kinematic skeleton should not be assumed as a single spatially codependent structure; in fact, we posit when a full 2D pose is provided during training, there is an inherent bias learned where the 3D coordinate of a keypoint is spatially codependent on the 2D coordinates of all other keypoints. To investigate our hypothesis we follow previous adversarial approaches but train two generators on spatially independent parts of the kinematic skeleton, the torso and the legs. We find that improving the self-consistency cycle is key to lowering the evaluation error and therefore introduce new consistency constraints during training. A TIPSy model is produced via knowledge distillation from these generators which can predict the 3D ordinates for the entire 2D pose with improved results. Furthermore, we address an unanswered question in prior work of how long to train in a truly unsupervised scenario. We show that for two independent generators training adversarially has improved stability than that of a solo generator which collapses. TIPSy decreases the average error by 17% when compared to that of a baseline solo generator on the Human3.6M dataset. TIPSy improves upon other unsupervised approaches while also performing strongly against supervised and weakly-supervised approaches during evaluation on both the Human3.6M and MPI-INF-3DHP datasets.

updated: Fri May 27 2022 15:11:36 GMT+0000 (UTC)

published: Thu May 12 2022 09:40:25 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト