Hierarchical Graph Networks for 3D Human Pose Estimation

Han Li; Bowen Shi; Wenrui Dai; Yabo Chen; Botao Wang; Yu Sun; Min Guo; Chenlin Li; Junni Zou; Hongkai Xiong

3D人間の姿勢推定のための階層グラフネットワーク

最近の2Dから3Dへの人間の姿勢推定作業は、人間の骨格のトポロジーによって形成されたグラフ構造を利用する傾向があります。ただし、この骨格トポロジーは、体の構造を反映するにはまばらすぎて、2Dから3Dへの深刻なあいまいさの問題に悩まされていると主張します。これらの弱点を克服するために、新しいグラフ畳み込みネットワークアーキテクチャであるHierarchical Graph Networks（HGN）を提案します。これは、マルチスケールグラフ構造構築戦略によって生成されたより高密度のグラフトポロジに基づいているため、より繊細な幾何学的情報を提供します。提案されたアーキテクチャには、並列に編成された3つのスパースからファインまでの表現サブネットワークが含まれています。このサブネットワークでは、マルチスケールのグラフ構造の特徴が処理され、新しい特徴融合戦略を通じて情報が交換され、豊富な階層表現が実現します。また、詳細関連の特徴学習をさらに強化するために、3D粗メッシュ制約を導入します。広範な実験により、HGNがネットワークパラメータを削減して最先端のパフォーマンスを実現することが実証されています

Recent 2D-to-3D human pose estimation works tend to utilize the graph structure formed by the topology of the human skeleton. However, we argue that this skeletal topology is too sparse to reflect the body structure and suffer from serious 2D-to-3D ambiguity problem. To overcome these weaknesses, we propose a novel graph convolution network architecture, Hierarchical Graph Networks (HGN). It is based on denser graph topology generated by our multi-scale graph structure building strategy, thus providing more delicate geometric information. The proposed architecture contains three sparse-to-fine representation subnetworks organized in parallel, in which multi-scale graph-structured features are processed and exchange information through a novel feature fusion strategy, leading to rich hierarchical representations. We also introduce a 3D coarse mesh constraint to further boost detail-related feature learning. Extensive experiments demonstrate that our HGN achieves the state-of-the art performance with reduced network parameters

updated: Tue Nov 23 2021 15:09:03 GMT+0000 (UTC)

published: Tue Nov 23 2021 15:09:03 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト