Scan-based Semantic Segmentation of LiDAR Point Clouds: An Experimental Study

Larissa T. Triess; David Peter; Christoph B. Rist; J. Marius Zöllner

LiDAR点群のスキャンベースのセマンティックセグメンテーション：実験的研究

自動運転車は、環境について推論するために、周囲の3次元世界を意味的に理解する必要があります。最先端の方法は、ディープニューラルネットワークを使用して、LiDARスキャンの各ポイントのセマンティッククラスを予測します。 LiDAR測定を処理する強力で効率的な方法は、2次元の画像のような投影を使用することです。この作業では、LiDAR点群の画像ベースのセマンティックセグメンテーションアーキテクチャの包括的な実験的研究を実行します。パフォーマンスを向上させ、ランタイムとメモリの制約を改善するためのさまざまな手法を示します。最初に、ネットワークサイズの影響を調べ、非常に低いコストで精度を高めて推論時間を大幅に短縮できることを示します。次に、系統的なオクルージョンの影響を受けない改良された点群投影手法を紹介します。水平視野境界でコンテキストを提供する循環パディングメカニズムを使用します。 3番目の部分では、unsections-over-unionメトリックを直接最適化するソフトダイス損失関数を使用して実験を行います。最後に、LiDARスキャンの垂直軸に沿った外観の大きな違いに対処し、2つの空間次元の1つに沿って重み共有の量を減らした新しい種類の畳み込み層を提案します。モデルがベースラインを超えてmIoUセグメンテーションパフォーマンスの3.2％の増加を達成しながら、元の推論時間の42％しか必要としない、上記のメソッドの最後のセットを提案します。

Autonomous vehicles need to have a semantic understanding of the three-dimensional world around them in order to reason about their environment. State of the art methods use deep neural networks to predict semantic classes for each point in a LiDAR scan. A powerful and efficient way to process LiDAR measurements is to use two-dimensional, image-like projections. In this work, we perform a comprehensive experimental study of image-based semantic segmentation architectures for LiDAR point clouds. We demonstrate various techniques to boost the performance and to improve runtime as well as memory constraints. First, we examine the effect of network size and suggest that much faster inference times can be achieved at a very low cost to accuracy. Next, we introduce an improved point cloud projection technique that does not suffer from systematic occlusions. We use a cyclic padding mechanism that provides context at the horizontal field-of-view boundaries. In a third part, we perform experiments with a soft Dice loss function that directly optimizes for the intersection-over-union metric. Finally, we propose a new kind of convolution layer with a reduced amount of weight-sharing along one of the two spatial dimensions, addressing the large difference in appearance along the vertical axis of a LiDAR scan. We propose a final set of the above methods with which the model achieves an increase of 3.2% in mIoU segmentation performance over the baseline while requiring only 42% of the original inference time.

updated: Fri Sep 24 2021 07:28:13 GMT+0000 (UTC)

published: Mon Apr 06 2020 11:08:12 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト