3D Convolutional Networks for Action Recognition: Application to Sport Gesture Recognition

Pierre-Etienne Martin; J Benois-Pineau; R Péteri; A Zemmari; J Morlier

アクション認識のための3D畳み込みネットワーク：スポーツジェスチャ認識への応用

3D畳み込みネットワークは、ビデオのコヒーレントな時空間チャンクへのセグメンテーションや、ターゲット分類法に関するそれらの分類などのタスクを実行するための優れた手段です。この章では、卓球のストロークなど、繰り返し可能なアクションを伴う連続ビデオテイクの分類に関心があります。エコロジーの少ない無料のマーカーで撮影されたこれらのビデオは、セグメンテーションと分類の両方の観点からの課題を表しています。 3D convnetsは、ウィンドウベースのアプローチでこれらの問題を解決するための効率的なツールです。

3D convolutional networks is a good means to perform tasks such as video segmentation into coherent spatio-temporal chunks and classification of them with regard to a target taxonomy. In the chapter we are interested in the classification of continuous video takes with repeatable actions, such as strokes of table tennis. Filmed in a free marker less ecological environment, these videos represent a challenge from both segmentation and classification point of view. The 3D convnets are an efficient tool for solving these problems with window-based approaches.

updated: Wed Apr 13 2022 13:21:07 GMT+0000 (UTC)

published: Wed Apr 13 2022 13:21:07 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト