Efficient Spatial-Temporal Information Fusion for LiDAR-Based 3D Moving Object Segmentation

Jiadai Sun; Yuchao Dai; Xianjing Zhang; Jintao Xu; Rui Ai; Weihao Gu; Xieyuanli Chen

LiDARベースの3D移動オブジェクトセグメンテーションのための効率的な時空間情報融合

正確な移動物体のセグメンテーションは、自動運転にとって不可欠なタスクです。衝突回避、経路計画、静的マップの構築など、多くのダウンストリームタスクに効果的な情報を提供できます。時空間情報を効果的に活用する方法は、3D LiDAR移動オブジェクトセグメンテーション（LiDAR-MOS）にとって重要な問題です。この作業では、LiDAR-MOSのパフォーマンスを向上させるために、LiDARスキャンの時空間情報とさまざまな表現モダリティの両方を活用する新しいディープニューラルネットワークを提案します。具体的には、最初に距離画像ベースのデュアルブランチ構造を使用して、順次LiDARスキャンから取得できる空間情報と時間情報を個別に処理し、後でモーションガイド注意モジュールを使用してそれらを組み合わせます。また、3Dスパース畳み込みを介したポイントリファインメントモジュールを使用して、LiDAR範囲画像とポイントクラウド表現の両方からの情報を融合し、オブジェクトの境界にあるアーティファクトを減らします。 SemanticKITTIのLiDAR-MOSベンチマークで提案されたアプローチの有効性を検証します。私たちの方法は、LiDAR-MOS IoUの点で、最先端の方法を大幅に上回っています。考案された粗いものから細かいものへのアーキテクチャの恩恵を受けて、私たちの方法はセンサーフレームレートでオンラインで動作します。このメソッドの実装は、https：//github.com/haomo-ai/MotionSeg3Dでオープンソースとして入手できます。

Accurate moving object segmentation is an essential task for autonomous driving. It can provide effective information for many downstream tasks, such as collision avoidance, path planning, and static map construction. How to effectively exploit the spatial-temporal information is a critical question for 3D LiDAR moving object segmentation (LiDAR-MOS). In this work, we propose a novel deep neural network exploiting both spatial-temporal information and different representation modalities of LiDAR scans to improve LiDAR-MOS performance. Specifically, we first use a range image-based dual-branch structure to separately deal with spatial and temporal information that can be obtained from sequential LiDAR scans, and later combine them using motion-guided attention modules. We also use a point refinement module via 3D sparse convolution to fuse the information from both LiDAR range image and point cloud representations and reduce the artifacts on the borders of the objects. We verify the effectiveness of our proposed approach on the LiDAR-MOS benchmark of SemanticKITTI. Our method outperforms the state-of-the-art methods significantly in terms of LiDAR-MOS IoU. Benefiting from the devised coarse-to-fine architecture, our method operates online at sensor frame rate. The implementation of our method is available as open source at: https://github.com/haomo-ai/MotionSeg3D.

updated: Tue Jul 05 2022 17:59:17 GMT+0000 (UTC)

published: Tue Jul 05 2022 17:59:17 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト