Monocular 3D Object Detection using Multi-Stage Approaches with Attention and Slicing aided hyper inference

Abonia Sojasingarayar; Ashish Patel

Attention と Slicing を使用したハイパー推論を使用した多段階アプローチを使用した単眼 3D オブジェクト検出

3D オブジェクト検出は、オブジェクトのサイズ、方向、および世界での位置をキャプチャできるため、非常に重要です。その結果、この 3D 検出を、拡張現実 (AR)、自動運転車、人間と同じように世界を認識するロボット工学などの現実世界のアプリケーションで使用できるようになります。単眼 3D オブジェクト検出は、単一の 2D RGB 画像内のオブジェクトの周囲に 3D バウンディングボックスを描画するタスクです。これはローカリゼーションタスクですが、深度や他のセンサー、複数の画像などの追加情報はありません。単眼 3D オブジェクトの検出は、重要かつ挑戦的なタスクです。画像ベースの 2D オブジェクト検出の大幅な進歩を超えて、現実世界のオブジェクトの 3D 理解は、これまで広く調査されていない未解決の課題です。最も密接に関連する研究に加えて。

3D object detection is vital as it would enable us to capture objects' sizes, orientation, and position in the world. As a result, we would be able to use this 3D detection in real-world applications such as Augmented Reality (AR), self-driving cars, and robotics which perceive the world the same way we do as humans. Monocular 3D Object Detection is the task to draw 3D bounding box around objects in a single 2D RGB image. It is localization task but without any extra information like depth or other sensors or multiple images. Monocular 3D object detection is an important yet challenging task. Beyond the significant progress in image-based 2D object detection, 3D understanding of real-world objects is an open challenge that has not been explored extensively thus far. In addition to the most closely related studies.

updated: Thu Dec 22 2022 15:36:07 GMT+0000 (UTC)

published: Thu Dec 22 2022 15:36:07 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト