A Baseline Framework for Part-level Action Parsing and Action Recognition

Xiaodong Chen; Xinchen Liu; Kun Liu; Wu Liu; Tao Mei

パーツレベルのアクション解析とアクション認識のためのベースラインフレームワーク

このテクニカルレポートでは、ICCV DeeperAction Workshop 2021のパーツレベルのアクション解析に関するKinetics-TPSトラックの2位のソリューションを紹介します。エントリは主に、インスタンスとパーツの検出のためのYOLOF、人間の姿勢推定のためのHRNet、およびビデオレベルのためのCSNに基づいています。アクション認識とフレームレベルのパーツ状態解析。 Kinetics-TPSデータセットの技術的な詳細を、いくつかの実験結果とともに説明します。コンテストでは、Kinetics-TPSのテストセットで61.37％のmAPを達成しました。

This technical report introduces our 2nd place solution to Kinetics-TPS Track on Part-level Action Parsing in ICCV DeeperAction Workshop 2021. Our entry is mainly based on YOLOF for instance and part detection, HRNet for human pose estimation, and CSN for video-level action recognition and frame-level part state parsing. We describe technical details for the Kinetics-TPS dataset, together with some experimental results. In the competition, we achieved 61.37% mAP on the test set of Kinetics-TPS.

updated: Thu Oct 07 2021 12:04:59 GMT+0000 (UTC)

published: Thu Oct 07 2021 12:04:59 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト