Semantics-aware Multi-modal Domain Translation:From LiDAR Point Clouds to Panoramic Color Images

Tiago Cortinhal; Fatih Kurnaz; Eren Aksoy

セマンティクスを意識したマルチモーダルドメイン変換：LiDARポイントクラウドからパノラマカラー画像へ

この作業では、独自のデータ形式で異なるセンサーモダリティ間のドメイン変換の問題に対処するためのシンプルで効果的なフレームワークを提示します。シーンのセマンティクスのみに依存することにより、モジュラー生成フレームワークは、初めて、特定の完全な3DLiDARポイントクラウドからパノラマカラー画像を合成できます。フレームワークは、最初に球面に投影される点群のセマンティックセグメンテーションから始まります。同じセマンティックセグメンテーションが対応するカメラ画像に適用されます。次に、新しい条件付き生成モデルは、予測されたLiDARセグメントマップを対応するカメラ画像に変換することを敵対的に学習します。最後に、生成された画像セグメントが処理されて、パノラマシーン画像がレンダリングされます。 SemanticKittiデータセットの徹底的な定量的評価を提供し、提案されたフレームワークが他の強力なベースラインモデルよりも優れていることを示します。ソースコードはhttps://github.com/halmstad-University/TITAN-NETで入手できます。

In this work, we present a simple yet effective framework to address the domain translation problem between different sensor modalities with unique data formats. By relying only on the semantics of the scene, our modular generative framework can, for the first time, synthesize a panoramic color image from a given full 3D LiDAR point cloud. The framework starts with semantic segmentation of the point cloud, which is initially projected onto a spherical surface. The same semantic segmentation is applied to the corresponding camera image. Next, our new conditional generative model adversarially learns to translate the predicted LiDAR segment maps to the camera image counterparts. Finally, generated image segments are processed to render the panoramic scene images. We provide a thorough quantitative evaluation on the SemanticKitti dataset and show that our proposed framework outperforms other strong baseline models. Our source code is available at https://github.com/halmstad-University/TITAN-NET

updated: Sat Jun 26 2021 08:52:17 GMT+0000 (UTC)

published: Sat Jun 26 2021 08:52:17 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト