AMPose: Alternatively Mixed Global-Local Attention Model for 3D Human Pose Estimation

Hongxin Lin; Yunwei Chiu; Peiyuan Wu

AMPose: 3D 人間の姿勢推定のためのグローバルとローカルの注意の混合モデル

グラフ畳み込みネットワークは、3D 人間の姿勢推定に適用されています。さらに、純粋な変圧器モデルは、最近、ビデオベースの方法で有望な結果を示しています。ただし、単一フレーム法では、グローバルな注意のみによって変換された特徴表現には人間の骨格の関係が欠けているため、関節間の物理的に接続された関係をモデル化する必要があります。人間の関節間の物理的接続とグローバルな関係を結合するための新しいアーキテクチャを提案します。 Human3.6 でメソッドを評価し、最先端のモデルと比較します。私たちのモデルは、他のすべてのモデルよりも優れた結果を示しています。私たちのモデルは、MPI-INF-3DHP でのデータセット間の比較により、より優れた汎化能力を備えています。

The graph convolutional network has been applied to 3D human pose estimation. In addition, the pure transformer model recently show the promising result in the video-base method. However, the single-frame method still need to model the physically connected relations among joints because the feature representation transformed only by the global attention has the lack of the relationships of human skeleton. We propose a novel architecture to combine the physically connected and global relations among joints in human. We evaluate our method on Human3.6and compare with the state-of-the-art models. Our model show superior result over all other models. Our model has better generalization ability by cross-dataset comparison on MPI-INF-3DHP.

updated: Sun Oct 09 2022 10:10:13 GMT+0000 (UTC)

published: Sun Oct 09 2022 10:10:13 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト