EfficientPose: Scalable single-person pose estimation

Daniel Groos; Heri Ramampiaro; Espen A. F. Ihlen

EfficientPose：スケーラブルな一人のポーズの推定

一人の人間の姿勢推定は、スポーツや臨床応用におけるマーカーのない動きの分析を容易にします。それでも、人間の姿勢を推定するための最先端のモデルは、通常、実際のアプリケーションの要件を満たしていません。ディープラーニング技術の急増により、多くの高度なアプローチが開発されました。ただし、この分野での進歩に伴い、より複雑で非効率的なモデルも導入されており、計算要求が大幅に増加しています。これらの複雑さと非効率性の課題に対処するために、EfficientPoseと呼ばれる新しい畳み込みニューラルネットワークアーキテクチャを提案します。これは、最近提案されたEfficientNetを活用して、効率的でスケーラブルな1人のポーズ推定を提供します。 EfficientPoseは、効果的なマルチスケール特徴抽出器と、モバイル逆ボトルネック畳み込みを使用した計算効率の高い検出ブロックを利用するモデルのファミリーであると同時に、ポーズ構成の精度がさらに向上することを保証します。複雑さと効率が低いため、EfficientPoseは、メモリフットプリントと計算コストを制限することにより、エッジデバイスでの実際のアプリケーションを可能にします。挑戦的なMPII一人のベンチマークを使用した実験の結果は、提案されたEfficientPoseモデルが、精度と計算効率の両方の点で、広く使用されているOpenPoseモデルを大幅に上回っていることを示しています。特に、当社の最高性能モデルは、複雑性の低いConvNetを使用して、1人のMPIIで最先端の精度を実現します。

Single-person human pose estimation facilitates markerless movement analysis in sports, as well as in clinical applications. Still, state-of-the-art models for human pose estimation generally do not meet the requirements of real-life applications. The proliferation of deep learning techniques has resulted in the development of many advanced approaches. However, with the progresses in the field, more complex and inefficient models have also been introduced, which have caused tremendous increases in computational demands. To cope with these complexity and inefficiency challenges, we propose a novel convolutional neural network architecture, called EfficientPose, which exploits recently proposed EfficientNets in order to deliver efficient and scalable single-person pose estimation. EfficientPose is a family of models harnessing an effective multi-scale feature extractor and computationally efficient detection blocks using mobile inverted bottleneck convolutions, while at the same time ensuring that the precision of the pose configurations is still improved. Due to its low complexity and efficiency, EfficientPose enables real-world applications on edge devices by limiting the memory footprint and computational cost. The results from our experiments, using the challenging MPII single-person benchmark, show that the proposed EfficientPose models substantially outperform the widely-used OpenPose model both in terms of accuracy and computational efficiency. In particular, our top-performing model achieves state-of-the-art accuracy on single-person MPII, with low-complexity ConvNets.

updated: Fri Dec 04 2020 09:27:44 GMT+0000 (UTC)

published: Sat Apr 25 2020 16:50:46 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト