Normal Transformer: Extracting Surface Geometry from LiDAR Points Enhanced by Visual Semantics

Ancheng Lin; Jun Li

ノーマルトランスフォーマー: 視覚的セマンティクスによって強化された LiDAR ポイントからの表面形状の抽出

サーフェス法線の高品質な推定は、衝突回避やオクルージョン推論など、多くのジオメトリ理解の問題におけるあいまいさを減らすのに役立ちます。この論文では、3D 点群と 2D カラー画像から法線を推定する手法を紹介します。ビジュアルセマンティックデータと 3D ジオメトリックデータのハイブリッド情報、および効果的な学習戦略を利用することを学習するトランスフォーマーニューラルネットワークを開発しました。既存の方法と比較して、提案された方法の情報融合はより効果的であり、これは実験によって裏付けられています。また、3D レンダリングエンジンで屋外交通シーンのシミュレーション環境を構築し、法線推定器をトレーニングするための注釈付きデータを取得しました。合成データでトレーニングされたモデルは、KITTI データセットの実際のシーンでテストされます。そして、KITTI データセットで推定された法線方向に基づいて構築された後続のタスクは、提案された推定量が既存の方法よりも優れていることを示しています。

High-quality estimation of surface normal can help reduce ambiguity in many geometry understanding problems, such as collision avoidance and occlusion inference. This paper presents a technique for estimating the normal from 3D point clouds and 2D colour images. We have developed a transformer neural network that learns to utilise the hybrid information of visual semantic and 3D geometric data, as well as effective learning strategies. Compared to existing methods, the information fusion of the proposed method is more effective, which is supported by experiments. We have also built a simulation environment of outdoor traffic scenes in a 3D rendering engine to obtain annotated data to train the normal estimator. The model trained on synthetic data is tested on the real scenes in the KITTI dataset. And subsequent tasks built upon the estimated normal directions in the KITTI dataset show that the proposed estimator has advantage over existing methods.

updated: Sat Nov 19 2022 03:55:09 GMT+0000 (UTC)

published: Sat Nov 19 2022 03:55:09 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト