Accurate and Efficient Event-based Semantic Segmentation Using Adaptive Spiking Encoder-Decoder Network

Rui Zhang; Luziwei Leng; Kaiwei Che; Hu Zhang; Jie Cheng; Qinghai Guo; Jiangxing Liao; Ran Cheng

適応スパイキングエンコーダ/デコーダネットワークを使用した正確かつ効率的なイベントベースのセマンティックセグメンテーション

低電力のイベント駆動型計算と固有の時間ダイナミクスを活用するスパイキングニューラルネットワーク (SNN) は、イベントベースのセンサーからの動的信号と非同期信号を処理するための理想的なソリューションとなる可能性があります。ただし、トレーニングにおける課題とアーキテクチャ設計の制限により、人工ニューラルネットワーク (ANN) と比較した場合、イベントベースの高密度予測の領域で競合する SNN の例は限られています。この論文では、大規模なイベントベースのセマンティックセグメンテーションタスク向けに設計された効率的なスパイキングエンコーダ/デコーダネットワークを紹介します。これは、階層検索方法を使用してエンコーダを最適化することで実現されます。動的なイベントストリームからの学習を強化するために、スパイキングニューロンの固有の適応閾値を利用してネットワークの活性化を調整します。さらに、疎なイベントの表現を強化するために特別に設計されたデュアルパススパイキング空間適応変調 (SSAM) ブロックを導入し、それによってネットワークパフォーマンスを大幅に向上させます。私たちが提案するネットワークは、DDD17 データセットで 72.57% の平均和集合 (MIoU) を達成し、最近導入されたより大規模な DSEC-Semantic データセットで 57.22% の MIoU を達成します。このパフォーマンスは、現在の最先端の ANN を 4% 上回っていますが、消費する計算リソースは大幅に少なくなります。私たちの知る限り、これは、要求の厳しいイベントベースのセマンティックセグメンテーションタスクにおいて SNN が ANN よりも優れていることを実証した最初の研究であり、それによってイベントベースのビジョンの分野における SNN の膨大な可能性が確立されました。私たちのソースコードは一般公開されます。

Leveraging the low-power, event-driven computation and the inherent temporal dynamics, spiking neural networks (SNNs) are potentially ideal solutions for processing dynamic and asynchronous signals from event-based sensors. However, due to the challenges in training and the restrictions in architectural design, there are limited examples of competitive SNNs in the realm of event-based dense prediction when compared to artificial neural networks (ANNs). In this paper, we present an efficient spiking encoder-decoder network designed for large-scale event-based semantic segmentation tasks. This is achieved by optimizing the encoder using a hierarchical search method. To enhance learning from dynamic event streams, we harness the inherent adaptive threshold of spiking neurons to modulate network activation. Moreover, we introduce a dual-path Spiking Spatially-Adaptive Modulation (SSAM) block, specifically designed to enhance the representation of sparse events, thereby considerably improving network performance. Our proposed network achieves a 72.57% mean intersection over union (MIoU) on the DDD17 dataset and a 57.22% MIoU on the recently introduced, larger DSEC-Semantic dataset. This performance surpasses the current state-of-the-art ANNs by 4%, whilst consuming significantly less computational resources. To the best of our knowledge, this is the first study demonstrating SNNs outperforming ANNs in demanding event-based semantic segmentation tasks, thereby establishing the vast potential of SNNs in the field of event-based vision. Our source code will be made publicly accessible.

updated: Sun Jul 09 2023 08:30:41 GMT+0000 (UTC)

published: Mon Apr 24 2023 07:12:50 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト