A Voxel Graph CNN for Object Classification with Event Cameras

Yongjian Deng; Hao Chen; Hai Liu; Youfu Li

イベントカメラによるオブジェクト分類のためのボクセルグラフCNN

イベントカメラは、消費電力が少なく、ダイナミックレンジが高く、時間分解能が非常に高いため、研究者の注目を集めています。イベントベースのオブジェクト分類に関する学習モデルは、最近、スパースイベントを高密度フレームに蓄積して従来の2D学習方法を適用することにより、大成功を収めました。しかし、これらのアプローチは重いモデルを必要とし、スパースからデンスへの変換によって導入される冗長な情報のために計算が非常に複雑になり、実際のアプリケーションでのイベントカメラの可能性が制限されます。この研究は、イベントベースの分類モデルの精度とモデルの複雑さのバランスを取るという主要な問題に対処することを目的としています。この目的のために、イベントデータの新しいグラフ表現を導入して、そのスパース性をより有効に活用し、イベントベースの分類のために軽量ボクセルグラフ畳み込みニューラルネットワーク（EV-VGCNN）をカスタマイズします。具体的には、（1）以前のポイント単位の入力ではなくボクセル単位の頂点を使用して、スパース性を維持しながらイベントストリームの地域の2Dセマンティクスを明示的に活用します。（2）空間と動きを抽出するために、マルチスケールフィーチャリレーショナルレイヤー（MFRL）を提案します。隣接する頂点までの距離に関して、各頂点から識別的に手がかりを与えます。包括的な実験は、私たちのモデルが非常に低いモデルの複雑さ（わずか0.84Mのパラメーター）で最先端の分類精度を向上させることができることを示しています。

Event cameras attract researchers' attention due to their low power consumption, high dynamic range, and extremely high temporal resolution. Learning models on event-based object classification have recently achieved massive success by accumulating sparse events into dense frames to apply traditional 2D learning methods. Yet, these approaches necessitate heavy-weight models and are with high computational complexity due to the redundant information introduced by the sparse-to-dense conversion, limiting the potential of event cameras on real-life applications. This study aims to address the core problem of balancing accuracy and model complexity for event-based classification models. To this end, we introduce a novel graph representation for event data to exploit their sparsity better and customize a lightweight voxel graph convolutional neural network (EV-VGCNN) for event-based classification. Specifically, (1) using voxel-wise vertices rather than previous point-wise inputs to explicitly exploit regional 2D semantics of event streams while keeping the sparsity;(2) proposing a multi-scale feature relational layer (MFRL) to extract spatial and motion cues from each vertex discriminatively concerning its distances to neighbors. Comprehensive experiments show that our model can advance state-of-the-art classification accuracy with extremely low model complexity (merely 0.84M parameters).

updated: Fri Apr 08 2022 05:06:06 GMT+0000 (UTC)

published: Tue Jun 01 2021 04:07:03 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト