Walking Your LiDOG: A Journey Through Multiple Domains for LiDAR Semantic Segmentation

Cristiano Saltori; Aljoša Ošep; Elisa Ricci; Laura Leal-Taixé

LiDOG を歩く: LiDAR セマンティックセグメンテーションのための複数のドメインへの旅

多様な環境で安全に動作できるロボットを展開する機能は、具現化されたインテリジェントエージェントを開発するために重要です。コミュニティとして、私たちはドメイン内 LiDAR セマンティックセグメンテーションで大きな進歩を遂げました。しかし、これらの方法はドメイン間で一般化されていますか?この質問に答えるために、LiDAR セマンティックセグメンテーション (DG-LSS) のドメイン一般化 (DG) を研究するための最初の実験セットアップを設計します。私たちの結果は、クロスドメイン設定で評価されたメソッド間に大きなギャップがあることを確認しています。たとえば、ソースデータセット (SemanticKITTI) でトレーニングされたモデルは、ターゲットデータで 26.53 mIoU を取得しますが、ターゲットでトレーニングされたモデルでは 48.49 mIoU が取得されます。ドメイン (nuScenes)。このギャップに取り組むために、DG-LSS 用に特別に設計された最初の方法を提案します。これは、ターゲットドメインで 34.88 mIoU を取得し、すべてのベースラインよりも優れています。私たちの方法は、点群の鳥瞰図を分類することを学習する追加の密な 2D 畳み込みデコーダーを使用して、疎な畳み込みエンコーダーデコーダー 3D セグメンテーションネットワークを拡張します。この単純な補助タスクにより、3D ネットワークは、センサーの配置の変化と解像度に対して堅牢で、ドメイン間で転送可能な機能を学習するようになります。この作業により、コミュニティがこのようなクロスドメイン条件で将来のモデルを開発および評価するように促すことを目指しています。

The ability to deploy robots that can operate safely in diverse environments is crucial for developing embodied intelligent agents. As a community, we have made tremendous progress in within-domain LiDAR semantic segmentation. However, do these methods generalize across domains? To answer this question, we design the first experimental setup for studying domain generalization (DG) for LiDAR semantic segmentation (DG-LSS). Our results confirm a significant gap between methods, evaluated in a cross-domain setting: for example, a model trained on the source dataset (SemanticKITTI) obtains 26.53 mIoU on the target data, compared to 48.49 mIoU obtained by the model trained on the target domain (nuScenes). To tackle this gap, we propose the first method specifically designed for DG-LSS, which obtains 34.88 mIoU on the target domain, outperforming all baselines. Our method augments a sparse-convolutional encoder-decoder 3D segmentation network with an additional, dense 2D convolutional decoder that learns to classify a birds-eye view of the point cloud. This simple auxiliary task encourages the 3D network to learn features that are robust to sensor placement shifts and resolution, and are transferable across domains. With this work, we aim to inspire the community to develop and evaluate future models in such cross-domain conditions.

updated: Tue Aug 29 2023 10:08:24 GMT+0000 (UTC)

published: Sun Apr 23 2023 17:43:29 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト