3D-VField: Adversarial Augmentation of Point Clouds for Domain Generalization in 3D Object Detection

Alexander Lehner; Stefano Gasperini; Alvaro Marcos-Ramiro; Michael Schmidt; Mohammad-Ali Nikouei Mahani; Nassir Navab; Benjamin Busam; Federico Tombari

3D-VField：3Dオブジェクト検出におけるドメイン一般化のための点群の敵対的増強

点群での3Dオブジェクト検出は、ポイント間の幾何学的関係に依存しているため、非標準のオブジェクト形状は、メソッドの検出機能を妨げる可能性があります。ただし、セーフティクリティカルな設定では、ドメイン外およびロングテールのサンプルに対する堅牢性が、損傷した車や珍しい車の誤検出などの危険な問題を回避するための基本です。この作業では、トレーニング中に点群を変形することにより、3Dオブジェクト検出器のドメイン外データへの一般化を大幅に改善します。これは、3D-VFieldを使用して実現します。これは、敵対的な方法で学習したベクトル場を介してオブジェクトをもっともらしく変形する新しいデータ拡張方法です。私たちのアプローチでは、3Dポイントをセンサービュー光線に沿ってスライドするように制限しますが、それらを追加したり削除したりすることはありません。得られたベクトルは転送可能で、サンプルに依存せず、形状とオクルージョンを保持します。 KITTIなどの標準データセットでのみトレーニングを行っているにもかかわらず、ベクトルフィールドを拡張すると、さまざまな形状のオブジェクトやシーンへの一般化が大幅に向上します。この目的に向けて、CrashDを提案し、共有します。これは、さまざまなクラッシュシナリオを使用した、現実的な損傷車と希少車の合成データセットです。 KITTI、Waymo、CrashD、SUN RGB-Dに関する広範な実験により、屋内と屋外の両方のシーンで、ドメイン外のデータ、さまざまなモデルとセンサー、つまりLiDARとToFカメラに対する技術の一般化が示されています。 CrashDデータセットは、https：//crashd-cars.github.ioで入手できます。

As 3D object detection on point clouds relies on the geometrical relationships between the points, non-standard object shapes can hinder a method's detection capability. However, in safety-critical settings, robustness to out-of-domain and long-tail samples is fundamental to circumvent dangerous issues, such as the misdetection of damaged or rare cars. In this work, we substantially improve the generalization of 3D object detectors to out-of-domain data by deforming point clouds during training. We achieve this with 3D-VField: a novel data augmentation method that plausibly deforms objects via vector fields learned in an adversarial fashion. Our approach constrains 3D points to slide along their sensor view rays while neither adding nor removing any of them. The obtained vectors are transferable, sample-independent and preserve shape and occlusions. Despite training only on a standard dataset, such as KITTI, augmenting with our vector fields significantly improves the generalization to differently shaped objects and scenes. Towards this end, we propose and share CrashD: a synthetic dataset of realistic damaged and rare cars, with a variety of crash scenarios. Extensive experiments on KITTI, Waymo, our CrashD and SUN RGB-D show the generalizability of our techniques to out-of-domain data, different models and sensors, namely LiDAR and ToF cameras, for both indoor and outdoor scenes. Our CrashD dataset is available at https://crashd-cars.github.io.

updated: Tue May 03 2022 09:37:49 GMT+0000 (UTC)

published: Thu Dec 09 2021 08:50:54 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト