PersFormer: 3D Lane Detection via Perspective Transformer and the OpenLane Benchmark

Li Chen; Chonghao Sima; Yang Li; Zehan Zheng; Jiajie Xu; Xiangwei Geng; Hongyang Li; Conghui He; Jianping Shi; Yu Qiao; Junchi Yan

PersFormer：パースペクティブトランスフォーマーとOpenLaneベンチマークによる3Dレーン検出

最近、多くの自動運転シナリオ（上り坂/下り坂、バンプなど）での不正確な車線レイアウトの問題に対処するために、3D車線検出の方法が提案されました。以前の作業は、正面図と鳥瞰図（BEV）の間の空間変換の単純な設計と、現実的なデータセットの欠如のために、複雑なケースで苦労していました。これらの問題に向けて、PersFormerを紹介します。これは、新しいTransformerベースの空間機能変換モジュールを備えたエンドツーエンドの単眼3Dレーン検出器です。私たちのモデルは、カメラパラメータを参照として、関連する正面図のローカル領域に注目することでBEV機能を生成します。 PersFormerは、統一された2D / 3Dアンカー設計と補助タスクを採用して、2D / 3Dレーンを同時に検出し、機能の一貫性を高め、マルチタスク学習の利点を共有します。さらに、最初の大規模な実世界の3Dレーンデータセットの1つであるOpenLaneをリリースします。これは、高品質のアノテーションとシナリオの多様性を備えています。 OpenLaneには、200,000フレーム、880,000を超えるインスタンスレベルのレーン、14のレーンカテゴリ、およびレーン検出とより産業関連の自動運転方法の開発を促進するシーンタグとクローズドインパスオブジェクトアノテーションが含まれています。 PersFormerは、新しいOpenLaneデータセットおよびApollo 3D Lane Syntheticデータセットの3Dレーン検出タスクで競合するベースラインを大幅に上回り、OpenLaneの2Dタスクの最先端のアルゴリズムと同等であることを示しています。プロジェクトページはhttps://github.com/OpenPerceptionX/PersFormer_3DLaneで利用でき、OpenLaneデータセットはhttps://github.com/OpenPerceptionX/OpenLaneで提供されています。

Methods for 3D lane detection have been recently proposed to address the issue of inaccurate lane layouts in many autonomous driving scenarios (uphill/downhill, bump, etc.). Previous work struggled in complex cases due to their simple designs of the spatial transformation between front view and bird's eye view (BEV) and the lack of a realistic dataset. Towards these issues, we present PersFormer: an end-to-end monocular 3D lane detector with a novel Transformer-based spatial feature transformation module. Our model generates BEV features by attending to related front-view local regions with camera parameters as a reference. PersFormer adopts a unified 2D/3D anchor design and an auxiliary task to detect 2D/3D lanes simultaneously, enhancing the feature consistency and sharing the benefits of multi-task learning. Moreover, we release one of the first large-scale real-world 3D lane datasets: OpenLane, with high-quality annotation and scenario diversity. OpenLane contains 200,000 frames, over 880,000 instance-level lanes, 14 lane categories, along with scene tags and the closed-in-path object annotations to encourage the development of lane detection and more industrial-related autonomous driving methods. We show that PersFormer significantly outperforms competitive baselines in the 3D lane detection task on our new OpenLane dataset as well as Apollo 3D Lane Synthetic dataset, and is also on par with state-of-the-art algorithms in the 2D task on OpenLane. The project page is available at https://github.com/OpenPerceptionX/PersFormer_3DLane and OpenLane dataset is provided at https://github.com/OpenPerceptionX/OpenLane.

updated: Tue Jul 19 2022 10:00:22 GMT+0000 (UTC)

published: Mon Mar 21 2022 16:12:53 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト