Hierarchical growing grid networks for skeleton based action recognition

Zahra Gharaee

スケルトンベースのアクション認識のための階層的に成長するグリッドネットワーク

本論文では、成長するグリッドニューラルネットワークの層を適用することにより、行動認識のための新しい認知アーキテクチャを開発し、これらの層を使用することで、システムはその表現構造を自動的に配置できるようになります。成長段階でのニューラルマップの拡張に加えて、システムには入力空間の事前知識が提供され、学習段階の処理速度が向上します。成長するグリッドネットワークの2つの層とは別に、アーキテクチャは前処理層、順序付けられたベクトル表現層、および1層の教師ありニューラルネットワークで構成されます。これらのレイヤーは、アクション認識の問題を解決するように設計されています。第1層の成長グリッドは、人間の行動の入力データを受け取り、ニューラルマップは、トレーニングされたマップの誘発されたアクティブ化を接続することによって、各アクションシーケンスを表すアクションパターンベクトルを生成します。次に、パターンベクトルが順序付きベクトル表現層に送信され、第2層の成長グリッドのキーアクティベーションの時不変入力ベクトルが構築されます。第2層の成長グリッドは、入力ベクトルを対応するアクションクラスター/サブクラスターに分類し、最後に、1層の教師ありニューラルネットワークが成形クラスターにアクションラベルを付けます。アクションの異なるデータセットを使用した3つの実験は、システムがアクションを迅速かつ効率的に分類することを学習できることを示しています。成長するグリッドアーキテクチャのパフォーマンスは、自己組織化マップに基づくシステムの結果と比較され、成長するグリッドアーキテクチャがアクション認識タスクで非常に優れたパフォーマンスを発揮することを示しています。

In this paper, a novel cognitive architecture for action recognition is developed by applying layers of growing grid neural networks.Using these layers makes the system capable of automatically arranging its representational structure. In addition to the expansion of the neural map during the growth phase, the system is provided with a prior knowledge of the input space, which increases the processing speed of the learning phase. Apart from two layers of growing grid networks the architecture is composed of a preprocessing layer, an ordered vector representation layer and a one-layer supervised neural network. These layers are designed to solve the action recognition problem. The first-layer growing grid receives the input data of human actions and the neural map generates an action pattern vector representing each action sequence by connecting the elicited activation of the trained map. The pattern vectors are then sent to the ordered vector representation layer to build the time-invariant input vectors of key activations for the second-layer growing grid. The second-layer growing grid categorizes the input vectors to the corresponding action clusters/sub-clusters and finally the one-layer supervised neural network labels the shaped clusters with action labels. Three experiments using different datasets of actions show that the system is capable of learning to categorize the actions quickly and efficiently. The performance of the growing grid architecture is com-pared with the results from a system based on Self-Organizing Maps, showing that the growing grid architecture performs significantly superior on the action recognition tasks.

updated: Thu Apr 22 2021 16:35:32 GMT+0000 (UTC)

published: Thu Apr 22 2021 16:35:32 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト