Reconstruct from Top View: A 3D Lane Detection Approach based on Geometry Structure Prior

Chenguang Li; Jia Shi; Ya Wang; Guangliang Cheng

上面図からの再構築：事前のジオメトリ構造に基づく3Dレーン検出アプローチ

この論文では、2Dから3Dへのレーン再構築のプロセスの下でジオメトリ構造を活用することにより、単眼3Dレーン検出の問題をターゲットとする高度なアプローチを提案します。以前の方法に触発されて、最初に3Dレーンと地上の2D表現の間のジオメトリヒューリスティックを分析し、事前の構造に基づいて明示的な監視を課すことを提案します。これにより、レーン間およびレーン内の関係を構築して容易にすることができます。ローカルからグローバルへの3Dレーンの再構築。次に、2Dレーン表現での構造損失を減らすために、正面の画像から上面のレーン情報を直接抽出します。これにより、以前の方法での遠方のレーンの特徴の混乱が大幅に緩和されます。さらに、パイプラインでセグメンテーションタスクと再構築タスクの両方の新しいトレーニングデータを合成することにより、新しいタスク固有のデータ拡張方法を提案し、カメラのポーズと地面の傾斜の不均衡なデータ分布に対抗して、見えないデータの一般化を改善します。私たちの仕事は、ジオメトリの事前情報をDNNベースの3Dレーン検出に採用する最初の試みであり、元の検出範囲を2倍にして、非常に長い距離のレーンを検出できるようにします。提案された方法は、追加費用なしで他のフレームワークによってスムーズに採用することができます。実験結果は、私たちの作業が、追加のパラメーターを導入することなく、82FPSのリアルタイム速度でApollo3D合成データセットの3.8％Fスコアによって最先端のアプローチを上回っていることを示しています。

In this paper, we propose an advanced approach in targeting the problem of monocular 3D lane detection by leveraging geometry structure underneath the process of 2D to 3D lane reconstruction. Inspired by previous methods, we first analyze the geometry heuristic between the 3D lane and its 2D representation on the ground and propose to impose explicit supervision based on the structure prior, which makes it achievable to build inter-lane and intra-lane relationships to facilitate the reconstruction of 3D lanes from local to global. Second, to reduce the structure loss in 2D lane representation, we directly extract top view lane information from front view images, which tremendously eases the confusion of distant lane features in previous methods. Furthermore, we propose a novel task-specific data augmentation method by synthesizing new training data for both segmentation and reconstruction tasks in our pipeline, to counter the imbalanced data distribution of camera pose and ground slope to improve generalization on unseen data. Our work marks the first attempt to employ the geometry prior information into DNN-based 3D lane detection and makes it achievable for detecting lanes in an extra-long distance, doubling the original detection range. The proposed method can be smoothly adopted by other frameworks without extra costs. Experimental results show that our work outperforms state-of-the-art approaches by 3.8% F-Score on Apollo 3D synthetic dataset at real-time speed of 82 FPS without introducing extra parameters.

updated: Tue Jun 21 2022 04:03:03 GMT+0000 (UTC)

published: Tue Jun 21 2022 04:03:03 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト