FCAF3D: Fully Convolutional Anchor-Free 3D Object Detection

Danila Rukhovich; Anna Vorontsova; Anton Konushin

FCAF3D：完全畳み込みアンカーフリーの3Dオブジェクト検出

最近、ロボット工学や拡張現実での有望なアプリケーションが、点群からの3Dオブジェクト検出にかなりの注目を集めています。この論文では、FCAF3Dを紹介します。これは、クラス初の完全畳み込みアンカーフリーの屋内3Dオブジェクト検出方法です。これは、点群のボクセル表現を使用し、スパース畳み込みでボクセルを処理する、シンプルでありながら効果的な方法です。 FCAF3Dは、単一の完全畳み込みフィードフォワードパスにより、最小限のランタイムで大規模なシーンを処理できます。既存の3Dオブジェクト検出方法は、オブジェクトのジオメトリについて事前に仮定しており、それらの一般化能力を制限すると主張します。以前の仮定を取り除くために、純粋にデータ駆動型の方法でより良い結果を得ることができる、方向付けられた境界ボックスの新しいパラメータ化を提案します。提案された方法は、ScanNet V2（+4.5）、SUN RGB-D（+3.5）、およびS3DIS（+20.5）データセットでmAP @ 0.5に関して最先端の3Dオブジェクト検出結果を実現します。コードとモデルはhttps://github.com/samsunglabs/fcaf3dで入手できます。

Recently, promising applications in robotics and augmented reality have attracted considerable attention to 3D object detection from point clouds. In this paper, we present FCAF3D - a first-in-class fully convolutional anchor-free indoor 3D object detection method. It is a simple yet effective method that uses a voxel representation of a point cloud and processes voxels with sparse convolutions. FCAF3D can handle large-scale scenes with minimal runtime through a single fully convolutional feed-forward pass. Existing 3D object detection methods make prior assumptions on the geometry of objects, and we argue that it limits their generalization ability. To get rid of any prior assumptions, we propose a novel parametrization of oriented bounding boxes that allows obtaining better results in a purely data-driven way. The proposed method achieves state-of-the-art 3D object detection results in terms of mAP@0.5 on ScanNet V2 (+4.5), SUN RGB-D (+3.5), and S3DIS (+20.5) datasets. The code and models are available at https://github.com/samsunglabs/fcaf3d.

updated: Wed Dec 01 2021 07:28:52 GMT+0000 (UTC)

published: Wed Dec 01 2021 07:28:52 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト