Canonical Voting: Towards Robust Oriented Bounding Box Detection in 3D Scenes

Yang You; Zelin Ye; Yujing Lou; Chengkun Li; Yong-Lu Li; Lizhuang Ma; Weiming Wang; Cewu Lu

正規投票：3Dシーンでのロバストな方向付けされたバウンディングボックス検出に向けて

センサーの進歩と点群の深層学習手法のおかげで、3Dオブジェクト検出が大きな注目を集めています。投票ネットのような現在の最先端の方法は、追加の多層パーセプトロンネットワークを使用して、オブジェクトの中心とボックスの向きに向けて直接オフセットを回帰します。回転分類が根本的に難しいため、オフセットと方向の両方の予測は正確ではありません。この作業では、直接オフセットをローカル正準座標（LCC）、ボックススケール、およびボックス方向に解きほぐします。 LCCとボックスのスケールのみが回帰され、ボックスの方向は正規の投票スキームによって生成されます。最後に、LCC対応の逆射影チェックアルゴリズムは、誤検知を排除して、生成された投票マップから境界ボックスを繰り返し切り取ります。私たちのモデルは、実際の点群スキャンの挑戦的な大規模データセットで最先端のパフォーマンスを実現します：ScanNet、SceneNN、それぞれ8.8および5.1mAPの改善。コードはhttps://github.com/qq456cvb/CanonicalVotingで入手できます。

3D object detection has attracted much attention thanks to the advances in sensors and deep learning methods for point clouds. Current state-of-the-art methods like VoteNet regress direct offset towards object centers and box orientations with an additional Multi-Layer-Perceptron network. Both their offset and orientation predictions are not accurate due to the fundamental difficulty in rotation classification. In the work, we disentangle the direct offset into Local Canonical Coordinates (LCC), box scales and box orientations. Only LCC and box scales are regressed while box orientations are generated by a canonical voting scheme. Finally, a LCC-aware back-projection checking algorithm iteratively cuts out bounding boxes from the generated vote maps, with the elimination of false positives. Our model achieves state-of-the-art performance on challenging large-scale datasets of real point cloud scans: ScanNet, SceneNN with 8.8 and 5.1 mAP improvement respectively. Code is available on https://github.com/qq456cvb/CanonicalVoting.

updated: Wed Mar 17 2021 03:02:13 GMT+0000 (UTC)

published: Tue Nov 24 2020 10:03:24 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト