FCOS3D: Fully Convolutional One-Stage Monocular 3D Object Detection

Tai Wang; Xinge Zhu; Jiangmiao Pang; Dahua Lin

FCOS3D：完全畳み込み1ステージ単眼3Dオブジェクト検出

単眼3D物体検出は、低コストという利点を考慮すると、自動運転にとって重要なタスクです。これは、主に深度情報の欠如に反映される固有の不適切な特性のため、従来の2Dケースよりもはるかに困難です。 2D検出の最近の進歩は、この問題をよりよく解決する機会を提供します。ただし、この3Dタスクで一般的に適合された2D検出器を機能させることは簡単ではありません。この論文では、完全畳み込み単段検波器に基づいて構築された手法でこの問題を研究し、一般的なフレームワークFCOS3Dを提案します。具体的には、最初に一般的に定義されている7-DoF 3Dターゲットを画像ドメインに変換し、それらを2Dおよび3D属性として分離します。次に、オブジェクトは2Dスケールを考慮してさまざまな機能レベルに分散され、トレーニング手順の投影された3D中心に従ってのみ割り当てられます。さらに、中心性は、3Dターゲット定式化に適合するように3D中心に基づく2Dガウス分布で再定義されます。これらすべてにより、このフレームワークはシンプルでありながら効果的であり、2D検出または2D-3D対応の事前情報が排除されます。私たちのソリューションは、NeurIPS2020のnuScenes3D検出チャレンジにおけるすべての視覚のみの方法の中で1位を達成しています。コードとモデルはhttps://github.com/open-mmlab/mmdetection3dでリリースされています。

Monocular 3D object detection is an important task for autonomous driving considering its advantage of low cost. It is much more challenging than conventional 2D cases due to its inherent ill-posed property, which is mainly reflected in the lack of depth information. Recent progress on 2D detection offers opportunities to better solving this problem. However, it is non-trivial to make a general adapted 2D detector work in this 3D task. In this paper, we study this problem with a practice built on a fully convolutional single-stage detector and propose a general framework FCOS3D. Specifically, we first transform the commonly defined 7-DoF 3D targets to the image domain and decouple them as 2D and 3D attributes. Then the objects are distributed to different feature levels with consideration of their 2D scales and assigned only according to the projected 3D-center for the training procedure. Furthermore, the center-ness is redefined with a 2D Gaussian distribution based on the 3D-center to fit the 3D target formulation. All of these make this framework simple yet effective, getting rid of any 2D detection or 2D-3D correspondence priors. Our solution achieves 1st place out of all the vision-only methods in the nuScenes 3D detection challenge of NeurIPS 2020. Code and models are released at https://github.com/open-mmlab/mmdetection3d.

updated: Fri Sep 24 2021 07:40:56 GMT+0000 (UTC)

published: Thu Apr 22 2021 09:35:35 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト