Canonical Voting: Towards Robust Oriented Bounding Box Detection in 3D Scenes

Yang You; Zelin Ye; Yujing Lou; Chengkun Li; Yong-Lu Li; Lizhuang Ma; Weiming Wang; Cewu Lu

正規投票：3Dシーンでのロバストな方向付けされたバウンディングボックス検出に向けて

センサーの進歩と点群の深層学習手法のおかげで、3Dオブジェクト検出が大きな注目を集めています。投票ネットのような現在の最先端の方法は、追加の多層パーセプトロンネットワークを使用して、オブジェクトの中心とボックスの向きに向けて直接オフセットを回帰します。回転分類が根本的に難しいため、オフセットと方向の予測はどちらも正確ではありません。この作業では、直接オフセットをローカル正準座標（LCC）、ボックススケール、およびボックスの向きに解きほぐします。 LCCとボックススケールのみがリグレッションされ、ボックスの向きは正規の投票スキームによって生成されます。最後に、LCC対応の逆投影チェックアルゴリズムは、誤検知を排除して、生成された投票マップから境界ボックスを繰り返し切り出します。私たちのモデルは、ScanNet、SceneNN、SUNRGB-Dの3つの標準的な実世界のベンチマークで最先端のパフォーマンスを実現しています。私たちのコードはhttps://github.com/qq456cvb/CanonicalVotingで入手できます。

3D object detection has attracted much attention thanks to the advances in sensors and deep learning methods for point clouds. Current state-of-the-art methods like VoteNet regress direct offset towards object centers and box orientations with an additional Multi-Layer-Perceptron network. Both their offset and orientation predictions are not accurate due to the fundamental difficulty in rotation classification. In the work, we disentangle the direct offset into Local Canonical Coordinates (LCC), box scales and box orientations. Only LCC and box scales are regressed, while box orientations are generated by a canonical voting scheme. Finally, an LCC-aware back-projection checking algorithm iteratively cuts out bounding boxes from the generated vote maps, with the elimination of false positives. Our model achieves state-of-the-art performance on three standard real-world benchmarks: ScanNet, SceneNN and SUN RGB-D. Our code is available on https://github.com/qq456cvb/CanonicalVoting.

updated: Wed Mar 09 2022 08:18:00 GMT+0000 (UTC)

published: Tue Nov 24 2020 10:03:24 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト