Viewpoint Equivariance for Multi-View 3D Object Detection

Dian Chen; Jie Li; Vitor Guizilini; Rares Ambrus; Adrien Gaidon

マルチビュー 3D オブジェクト検出のための視点等価分散

視覚センサーからの 3D オブジェクト検出は、ロボットシステムの基礎となる機能です。最先端の方法は、マルチビューカメラ入力からのオブジェクトバウンディングボックスの推論とデコードに焦点を当てています。この作業では、3D シーンの理解と幾何学的学習におけるマルチビューの一貫性の不可欠な役割から直感を得ます。この目的のために、VEDet を紹介します。これは、3D マルチビュージオメトリを活用して、視点の認識と等価性によってローカリゼーションを改善する新しい 3D オブジェクト検出フレームワークです。 VEDet は、クエリベースのトランスフォーマーアーキテクチャを活用し、3D 遠近法ジオメトリからの位置エンコーディングで画像の特徴を増強することにより、3D シーンをエンコードします。出力レベルでビュー条件付きクエリを設計します。これにより、トレーニング中に複数の仮想フレームを生成して、マルチビューの一貫性を強制することで視点の等価性を学習できます。入力レベルで位置エンコーディングとして挿入され、損失レベルで正則化されたマルチビュージオメトリは、3D オブジェクト検出のための豊富な幾何学的キューを提供し、nuScenes ベンチマークで最先端のパフォーマンスをもたらします。コードとモデルは、https://github.com/TRI-ML/VEDet で入手できます。

3D object detection from visual sensors is a cornerstone capability of robotic systems. State-of-the-art methods focus on reasoning and decoding object bounding boxes from multi-view camera input. In this work we gain intuition from the integral role of multi-view consistency in 3D scene understanding and geometric learning. To this end, we introduce VEDet, a novel 3D object detection framework that exploits 3D multi-view geometry to improve localization through viewpoint awareness and equivariance. VEDet leverages a query-based transformer architecture and encodes the 3D scene by augmenting image features with positional encodings from their 3D perspective geometry. We design view-conditioned queries at the output level, which enables the generation of multiple virtual frames during training to learn viewpoint equivariance by enforcing multi-view consistency. The multi-view geometry injected at the input level as positional encodings and regularized at the loss level provides rich geometric cues for 3D object detection, leading to state-of-the-art performance on the nuScenes benchmark. The code and model are made available at https://github.com/TRI-ML/VEDet.

updated: Sat Mar 25 2023 19:56:41 GMT+0000 (UTC)

published: Sat Mar 25 2023 19:56:41 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト