DualCross: Cross-Modality Cross-Domain Adaptation for Monocular BEV Perception

Yunze Man; Liang-Yan Gui; Yu-Xiong Wang

DualCross：単眼BEV知覚のためのクロスモダリティクロスドメイン適応

トレーニングと展開の間のドメインギャップを埋めることと、複数のセンサーモダリティを組み込むことは、自動運転にとって挑戦的でありながら重要な 2 つのトピックです。既存の作業は、上記のトピックの 1 つだけに焦点を当てており、現実世界のシナリオに広く存在するドメインとモダリティの同時シフトを見落としています。ヨーロッパで収集されたマルチセンサーデータでトレーニングされたモデルは、利用可能な入力センサーのサブセットを使用してアジアで実行する必要がある場合があります。この作業では、1 つのドメインの LiDAR センサーから点群の知識を転送する、より堅牢な単眼鳥瞰図 (BEV) 知覚モデルの学習を容易にするクロスモダリティクロスドメイン適応フレームワークである DualCross を提案します。トレーニングフェーズ中に、別のドメインのカメラのみのテストシナリオに移行します。この作業は、クロスドメインクロスセンサーの知覚と野生の単眼 3D タスクへの適応の最初のオープンな分析をもたらします。幅広いドメインシフトの下で大規模なデータセットでアプローチをベンチマークし、さまざまなベースラインに対して最先端の結果を示します。

Closing the domain gap between training and deployment and incorporating multiple sensor modalities are two challenging yet critical topics for self-driving. Existing work only focuses on single one of the above topics, overlooking the simultaneous domain and modality shift which pervasively exists in real-world scenarios. A model trained with multi-sensor data collected in Europe may need to run in Asia with a subset of input sensors available. In this work, we propose DualCross, a cross-modality cross-domain adaptation framework to facilitate the learning of a more robust monocular bird's-eye-view (BEV) perception model, which transfers the point cloud knowledge from a LiDAR sensor in one domain during the training phase to the camera-only testing scenario in a different domain. This work results in the first open analysis of cross-domain cross-sensor perception and adaptation for monocular 3D tasks in the wild. We benchmark our approach on large-scale datasets under a wide range of domain shifts and show state-of-the-art results against various baselines.

updated: Fri May 05 2023 17:58:45 GMT+0000 (UTC)

published: Fri May 05 2023 17:58:45 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト