CenterNet3D: An Anchor Free Object Detector for Point Cloud

Guojun Wang; Jian Wu; Bin Tian; Siyu Teng; Long Chen; Dongpu Cao

CenterNet3D：点群用のアンカーフリーオブジェクト検出器

点群からの正確で高速な3Dオブジェクト検出は、自動運転の重要なタスクです。既存の1ステージ3Dオブジェクト検出方法は、リアルタイムのパフォーマンスを実現できますが、非効率的で追加の後処理が必要なアンカーベースの検出器が主流です。このホワイトペーパーでは、アンカーを削除し、オブジェクトを単一の点、つまりバウンディングボックスの中心点としてモデル化します。中心点に基づいて、アンカーなしで3Dオブジェクト検出を実行するアンカーフリーのCenterNet3Dネットワークを提案します。 CenterNet3Dは、キーポイント推定を使用して中心点を見つけ、3Dバウンディングボックスを直接回帰します。ただし、点群には固有のスパース性があるため、3Dオブジェクトの中心点は空の空間にある可能性が高く、正確な境界を推定することは困難です。この問題を解決するために、オブジェクトの境界により多くの注意を払うようにCNNバックボーンを強制する追加のコーナー注意モジュールを提案します。さらに、1ステージ検出器は、予測された境界ボックスと対応する分類信頼度の間の不一致に悩まされていることを考慮して、信頼度を予測された境界ボックスに合わせるための効率的なキーポイントセンシティブワーピング操作を開発します。私たちが提案するCenterNet3Dは、最大抑制がないため、より効率的でシンプルになります。 CenterNet3Dは、広く使用されているKITTIデータセットとより挑戦的なnuScenesデータセットで評価します。私たちの方法は、すべての最先端のアンカーベースの1段階方法よりも優れており、2段階方法と同等のパフォーマンスを備えています。推論速度は20FPSで、最高の速度と精度のトレードオフを実現します。ソースコードはhttps://github.com/wangguojun2018/CenterNet3dでリリースされます。

Accurate and fast 3D object detection from point clouds is a key task in autonomous driving. Existing one-stage 3D object detection methods can achieve real-time performance, however, they are dominated by anchor-based detectors which are inefficient and require additional post-processing. In this paper, we eliminate anchors and model an object as a single point--the center point of its bounding box. Based on the center point, we propose an anchor-free CenterNet3D network that performs 3D object detection without anchors. Our CenterNet3D uses keypoint estimation to find center points and directly regresses 3D bounding boxes. However, because inherent sparsity of point clouds, 3D object center points are likely to be in empty space which makes it difficult to estimate accurate boundaries. To solve this issue, we propose an extra corner attention module to enforce the CNN backbone to pay more attention to object boundaries. Besides, considering that one-stage detectors suffer from the discordance between the predicted bounding boxes and corresponding classification confidences, we develop an efficient keypoint-sensitive warping operation to align the confidences to the predicted bounding boxes. Our proposed CenterNet3D is non-maximum suppression free which makes it more efficient and simpler. We evaluate CenterNet3D on the widely used KITTI dataset and more challenging nuScenes dataset. Our method outperforms all state-of-the-art anchor-based one-stage methods and has comparable performance to two-stage methods as well. It has an inference speed of 20 FPS and achieves the best speed and accuracy trade-off. Our source code will be released at https://github.com/wangguojun2018/CenterNet3d.

updated: Mon Oct 25 2021 14:49:24 GMT+0000 (UTC)

published: Mon Jul 13 2020 13:53:56 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト