Group Equivariant BEV for 3D Object Detection

Hongwei Liu; Jian Yang; Jianfeng Zhang; Dongheng Shao; Jielong Guo; Shaobo Li; Xuan Tang; Xian Wei

3D オブジェクト検出のためのグループ同変 BEV

最近、3D オブジェクト検出が大きな注目を集め、実際の道路シナリオで継続的な改善を実現しています。環境情報は、関心のあるオブジェクトを検出するために、単一のセンサーまたはマルチセンサーフュージョンから収集されます。ただし、現在の 3D オブジェクト検出アプローチのほとんどは、動的な運転シーンを考慮するのではなく、オブジェクトの検出精度を向上させるための高度なネットワークアーキテクチャの開発に焦点を当てており、車両に搭載されたセンサーから収集されたデータにはさまざまな摂動機能が含まれています。その結果、既存の研究では摂動の問題にまだ取り組むことができません。この問題を解決するために、BEV フュージョンオブジェクト検出ネットワークにグループ等価の概念を導入する、グループ等価理論に基づくグループ等価鳥瞰図ネットワーク (GeqBevNet) を提案します。グループ同変ネットワークは、BEV レベルの回転同変特徴抽出を容易にするために融合 BEV 特徴マップに埋め込まれているため、平均方向誤差が低くなります。 GeqBevNet の有効性を実証するために、ネットワークは、mAOE を 0.325 に減らすことができる nuScenes 検証データセットで検証されます。実験結果は、GeqBevNet が実際の道路シーンの 3D オブジェクト検出でより多くの回転同変特徴を抽出し、オブジェクトの向き予測のパフォーマンスを向上できることを示しています。

Recently, 3D object detection has attracted significant attention and achieved continuous improvement in real road scenarios. The environmental information is collected from a single sensor or multi-sensor fusion to detect interested objects. However, most of the current 3D object detection approaches focus on developing advanced network architectures to improve the detection precision of the object rather than considering the dynamic driving scenes, where data collected from sensors equipped in the vehicle contain various perturbation features. As a result, existing work cannot still tackle the perturbation issue. In order to solve this problem, we propose a group equivariant bird's eye view network (GeqBevNet) based on the group equivariant theory, which introduces the concept of group equivariant into the BEV fusion object detection network. The group equivariant network is embedded into the fused BEV feature map to facilitate the BEV-level rotational equivariant feature extraction, thus leading to lower average orientation error. In order to demonstrate the effectiveness of the GeqBevNet, the network is verified on the nuScenes validation dataset in which mAOE can be decreased to 0.325. Experimental results demonstrate that GeqBevNet can extract more rotational equivariant features in the 3D object detection of the actual road scene and improve the performance of object orientation prediction.

updated: Wed Apr 26 2023 09:00:31 GMT+0000 (UTC)

published: Wed Apr 26 2023 09:00:31 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト