Reconstruct from BEV: A 3D Lane Detection Approach based on Geometry Structure Prior

Chenguang Li; Jia Shi; Ya Wang; Guangliang Cheng

BEV からの再構築: 事前のジオメトリ構造に基づく 3D 車線検出アプローチ

この論文では、2D から 3D レーン再構築のプロセスの下にあるジオメトリ構造を活用することにより、単眼 3D レーン検出の問題を対象とする高度なアプローチを提案します。以前の方法に着想を得て、最初に 3D 車線とその地上の 2D 表現の間のジオメトリヒューリスティックを分析し、構造の事前に基づいて明示的な監視を課すことを提案します。これにより、車線間および車線内の関係を構築して容易にローカルからグローバルへの 3D レーンの再構築。次に、2D レーン表現における構造の損失を減らすために、正面図の画像から BEV レーン情報を直接抽出します。さらに、パイプラインのセグメンテーションタスクと再構築タスクの両方に新しいトレーニングデータを合成することにより、新しいタスク固有のデータ拡張方法を提案し、カメラの姿勢と地面の傾斜の不均衡なデータ分布に対処して、目に見えないデータの一般化を改善します。私たちの研究は、ジオメトリの事前情報を DNN ベースの 3D 車線検出に採用する最初の試みであり、非常に長い距離の車線を検出できるようにし、元の検出範囲を 2 倍にします。提案された方法は、追加コストなしで他のフレームワークでスムーズに採用できます。実験結果は、追加のパラメーターを導入することなく、82 FPS のリアルタイム速度で Apollo 3D 合成データセットで最先端のアプローチを 3.8% F スコアで上回ることを示しています。

In this paper, we propose an advanced approach in targeting the problem of monocular 3D lane detection by leveraging geometry structure underneath the process of 2D to 3D lane reconstruction. Inspired by previous methods, we first analyze the geometry heuristic between the 3D lane and its 2D representation on the ground and propose to impose explicit supervision based on the structure prior, which makes it achievable to build inter-lane and intra-lane relationships to facilitate the reconstruction of 3D lanes from local to global. Second, to reduce the structure loss in 2D lane representation, we directly extract BEV lane information from front view images, which tremendously eases the confusion of distant lane features in previous methods. Furthermore, we propose a novel task-specific data augmentation method by synthesizing new training data for both segmentation and reconstruction tasks in our pipeline, to counter the imbalanced data distribution of camera pose and ground slope to improve generalization on unseen data. Our work marks the first attempt to employ the geometry prior information into DNN-based 3D lane detection and makes it achievable for detecting lanes in an extra-long distance, doubling the original detection range. The proposed method can be smoothly adopted by other frameworks without extra costs. Experimental results show that our work outperforms state-of-the-art approaches by 3.8% F-Score on Apollo 3D synthetic dataset at real-time speed of 82 FPS without introducing extra parameters.

updated: Mon Nov 07 2022 23:04:16 GMT+0000 (UTC)

published: Tue Jun 21 2022 04:03:03 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト