LiDAR2Map: In Defense of LiDAR-Based Semantic Map Construction Using Online Camera Distillation

Song Wang; Wentong Li; Wenyu Liu; Xiaolu Liu; Jianke Zhu

LiDAR2Map: オンラインカメラ蒸留を使用した LiDAR ベースのセマンティックマップ構築の擁護

鳥瞰図 (BEV) の下でのセマンティックマップの構築は、自動運転において重要な役割を果たします。カメラ画像とは対照的に、LiDAR は正確な 3D 観察を提供し、キャプチャした 3D 特徴を本質的に BEV 空間に投影します。ただし、バニラの LiDAR ベースの BEV 機能には多くの不定のノイズが含まれることが多く、空間特徴にはテクスチャやセマンティックな手がかりがほとんどありません。この論文では、セマンティックマップを構築するための効果的な LiDAR ベースの方法を提案します。具体的には、セマンティックマップ構築のために堅牢なマルチスケール BEV 特徴を学習する BEV 特徴ピラミッドデコーダーを導入し、LiDAR ベースの手法の精度を大幅に向上させます。 LiDAR データのセマンティックキューの欠如によって引き起こされる欠陥を軽減するために、画像から点群へのセマンティック学習を促進するオンラインの Camera-to-LiDAR 蒸留スキームを紹介します。私たちの蒸留スキームは、BEV のカメラからセマンティック情報を吸収するための特徴レベルとロジットレベルの蒸留で構成されています。挑戦的な nuScenes データセットに関する実験結果は、セマンティックマップ構築における私たちが提案する LiDAR2Map の有効性を実証しています。これは、以前の LiDAR ベースの手法を 27.9% mIoU 以上大幅に上回り、最先端のカメラベースのアプローチよりも優れたパフォーマンスを発揮します。ソースコードはhttps://github.com/songw-zju/LiDAR2Mapから入手できます。

Semantic map construction under bird's-eye view (BEV) plays an essential role in autonomous driving. In contrast to camera image, LiDAR provides the accurate 3D observations to project the captured 3D features onto BEV space inherently. However, the vanilla LiDAR-based BEV feature often contains many indefinite noises, where the spatial features have little texture and semantic cues. In this paper, we propose an effective LiDAR-based method to build semantic map. Specifically, we introduce a BEV feature pyramid decoder that learns the robust multi-scale BEV features for semantic map construction, which greatly boosts the accuracy of the LiDAR-based method. To mitigate the defects caused by lacking semantic cues in LiDAR data, we present an online Camera-to-LiDAR distillation scheme to facilitate the semantic learning from image to point cloud. Our distillation scheme consists of feature-level and logit-level distillation to absorb the semantic information from camera in BEV. The experimental results on challenging nuScenes dataset demonstrate the efficacy of our proposed LiDAR2Map on semantic map construction, which significantly outperforms the previous LiDAR-based methods over 27.9% mIoU and even performs better than the state-of-the-art camera-based approaches. Source code is available at: https://github.com/songw-zju/LiDAR2Map.

updated: Mon Jun 05 2023 03:56:19 GMT+0000 (UTC)

published: Sat Apr 22 2023 12:05:29 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト