Human Action Recognition and Prediction: A Survey

Yu Kong; Yun Fu

人間の行動の認識と予測：調査

コンピュータビジョンと機械学習の急速な進歩に由来するビデオ分析タスクは、現在の状態を推測することから将来の状態を予測することへと移行しています。ビジョンベースのアクション認識とビデオからの予測は、アクション認識が完全なアクション実行に基づいて人間のアクション（現在の状態）を推測することであり、アクション予測が不完全なアクションの実行に基づいて人間のアクション（将来の状態）を予測することです。これらの2つのタスクは、視覚的監視、自動運転車、エンターテインメント、ビデオ検索などの爆発的に出現する現実世界のアプリケーションのために、最近特に普及しているトピックになっています。アクションの認識と予測のための堅牢で効果的なフレームワークを構築します。この論文では、行動認識と予測における完全な最先端技術を調査します。既存のモデル、一般的なアルゴリズム、技術的な問題、一般的なアクションデータベース、評価プロトコル、および有望な将来の方向性についても、体系的な議論が行われます。

Derived from rapid advances in computer vision and machine learning, video analysis tasks have been moving from inferring the present state to predicting the future state. Vision-based action recognition and prediction from videos are such tasks, where action recognition is to infer human actions (present state) based upon complete action executions, and action prediction to predict human actions (future state) based upon incomplete action executions. These two tasks have become particularly prevalent topics recently because of their explosively emerging real-world applications, such as visual surveillance, autonomous driving vehicle, entertainment, and video retrieval, etc. Many attempts have been devoted in the last a few decades in order to build a robust and effective framework for action recognition and prediction. In this paper, we survey the complete state-of-the-art techniques in action recognition and prediction. Existing models, popular algorithms, technical difficulties, popular action databases, evaluation protocols, and promising future directions are also provided with systematic discussions.

updated: Sun Feb 13 2022 04:11:52 GMT+0000 (UTC)

published: Thu Jun 28 2018 23:43:42 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト