The Spike Gating Flow: A Hierarchical Structure Based Spiking Neural Network for Online Gesture Recognition

Zihao Zhao; Yanhong Wang; Qiaosha Zou; Tie Xu; Fangbo Tao; Jiansong Zhang; Xiaoan Wang; C. -J. Richard Shi; Junwen Luo; Yuan Xie

スパイクゲーティングフロー：オンラインジェスチャ認識のための階層構造ベースのスパイキングニューラルネットワーク

行動認識は、ロボットビジョンや自動車などの新興産業分野におけるゲームチェンジャーになる可能性があるため、人工知能の刺激的な研究手段です。ただし、現在の深層学習は、膨大な計算コストと非効率的な学習のために、このようなアプリケーションにとって大きな課題に直面しています。したがって、オンラインアクションラーニング用のスパイキングゲーティングフロー（SGF）というタイトルの新しい脳に触発されたスパイキングニューラルネットワーク（SNN）ベースのシステムを開発します。開発したシステムは、階層的に組み立てられた複数のSGFユニットで構成されています。単一のSGFユニットには、特徴抽出レイヤー、イベント駆動型レイヤー、ヒストグラムベースのトレーニングレイヤーの3つのレイヤーが含まれます。開発されたシステム機能を実証するために、ベンチマークとして標準のダイナミックビジョンセンサー（DVS）ジェスチャ分類を採用しています。結果は、ディープラーニング（DL）に匹敵する87.5％の精度を達成できることを示していますが、トレーニング/推論データの数の比率は1.5：1と小さくなっています。また、学習プロセス中に必要なトレーニングエポックは1つだけです。一方、私たちの知る限り、これは非バックプロパゲーションアルゴリズムベースのSNNの中で最高の精度です。最後に、開発されたネットワークの数ショットの学習パラダイムを結論付けます。1）階層構造ベースのネットワーク設計には、人間の事前知識が含まれます。 2）コンテンツベースのグローバル動的機能検出用のSNN。

Action recognition is an exciting research avenue for artificial intelligence since it may be a game changer in the emerging industrial fields such as robotic visions and automobiles. However, current deep learning faces major challenges for such applications because of the huge computational cost and the inefficient learning. Hence, we develop a novel brain-inspired Spiking Neural Network (SNN) based system titled Spiking Gating Flow (SGF) for online action learning. The developed system consists of multiple SGF units which assembled in a hierarchical manner. A single SGF unit involves three layers: a feature extraction layer, an event-driven layer and a histogram-based training layer. To demonstrate the developed system capabilities, we employ a standard Dynamic Vision Sensor (DVS) gesture classification as a benchmark. The results indicate that we can achieve 87.5% accuracy which is comparable with Deep Learning (DL), but at smaller training/inference data number ratio 1.5:1. And only a single training epoch is required during the learning process. Meanwhile, to the best of our knowledge, this is the highest accuracy among the non-backpropagation algorithm based SNNs. At last, we conclude the few-shot learning paradigm of the developed network: 1) a hierarchical structure-based network design involves human prior knowledge; 2) SNNs for content based global dynamic feature detection.

updated: Sat Jun 04 2022 04:37:56 GMT+0000 (UTC)

published: Sat Jun 04 2022 04:37:56 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト