PVRED: A Position-Velocity Recurrent Encoder-Decoder for Human Motion Prediction

Hongsong Wang; Jian Dong; Bin Cheng; Jiashi Feng

PVRED：人間の動きを予測するための位置速度再帰エンコーダーデコーダー

過去のポーズを考慮して将来の人間のポーズを予測することを目的とした人間の動きの予測は、最近関心が高まっている。最近の多くのアプローチは、指数マップを使用して人間のポーズをモデル化するリカレントニューラルネットワーク（RNN）に基づいています。これらのアプローチは、ポーズの速度とさまざまなポーズの時間的関係を無視し、平均ポーズに収束するか、自然に見えるポーズを生成できない傾向があります。したがって、我々は、姿勢速度と時間的位置情報を最大限に活用する、人間の動きを予測するための新しい位置速度再帰エンコーダデコーダ（PVRED）を提案します。時間的位置埋め込み法が提示され、位置速度RNN（PVRNN）が提案されます。また、ポーズのクォータニオンパラメータ化の利点を強調し、トレーニング中の堅牢な損失関数と組み合わされた、新しいトレーニング可能なクォータニオン変換（QT）レイヤーを設計します。 0.5秒先の短期予測と0.5〜1秒先の長期予測の両方について定量的な結果を提供します。いくつかのベンチマークでの実験は、私たちのアプローチが最先端の方法を大幅に上回っていることを示しています。さらに、将来の4秒間の定性的な視覚化は、私たちのアプローチが非常に長い期間で将来の人間のような意味のあるポーズを予測できることを示しています。コードはGitHubで公開されています：redhttps：//github.com/hongsong-wang/PVRNN。

Human motion prediction, which aims to predict future human poses given past poses, has recently seen increased interest. Many recent approaches are based on Recurrent Neural Networks (RNN) which model human poses with exponential maps. These approaches neglect the pose velocity as well as temporal relation of different poses, and tend to converge to the mean pose or fail to generate natural-looking poses. We therefore propose a novel Position-Velocity Recurrent Encoder-Decoder (PVRED) for human motion prediction, which makes full use of pose velocities and temporal positional information. A temporal position embedding method is presented and a Position-Velocity RNN (PVRNN) is proposed. We also emphasize the benefits of quaternion parameterization of poses and design a novel trainable Quaternion Transformation (QT) layer, which is combined with a robust loss function during training. We provide quantitative results for both short-term prediction in the future 0.5 seconds and long-term prediction in the future 0.5 to 1 seconds. Experiments on several benchmarks show that our approach considerably outperforms the state-of-the-art methods. In addition, qualitative visualizations in the future 4 seconds show that our approach could predict future human-like and meaningful poses in very long time horizons. Code is publicly available on GitHub: redhttps://github.com/hongsong-wang/PVRNN.

updated: Sun Jun 13 2021 01:42:05 GMT+0000 (UTC)

published: Sat Jun 15 2019 09:59:30 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト