Point Cloud Semantic Segmentation using Multi Scale Sparse Convolution Neural Network

Yunzheng Su

マルチスケールスパース畳み込みニューラルネットワークを使用した点群セマンティックセグメンテーション

点群には、無秩序、非構造化、まばらさの特徴があります。画像処理における畳み込みニューラルネットワークの優れたパフォーマンスのおかげで、点群の非構造的性質の問題を目指して、解決策の1つは点群から特徴を抽出することです。 2次元畳み込みニューラルネットワークに基づいています。点群で運ばれる3次元情報は、2次元に変換され、2次元の畳み込みニューラルネットワークによって処理され、最後に3次元に逆投影されます.3D情報を2Dに投影する過程で、逆投影では、特定の情報損失が必然的に点群に引き起こされ、カテゴリの不整合が逆投影段階で導入されます。別の解決策は、点群を小さなグリッドに1つずつ分割するボクセルベースの点群セグメンテーション方法です。ただし、点群はまばらであり、3D畳み込みニューラルネットワークを直接使用すると、必然的にコンピューティングリソースが浪費されます。本論文では、マルチスケール超スパース畳み込みに基づく特徴抽出モジュールとチャネル注意に基づく特徴選択モジュールを提案し、これに基づいて点群セグメンテーションネットワークフレームワークを構築します。マルチスケールスパース畳み込みを導入することにより、ネットワークさまざまなサイズの畳み込みカーネルに基づいてより豊富な特徴情報をキャプチャし、点群セグメンテーションのセグメンテーション結果を改善できます。

Point clouds have the characteristics of disorder, unstructured and sparseness.Aiming at the problem of the non-structural nature of point clouds, thanks to the excellent performance of convolutional neural networks in image processing, one of the solutions is to extract features from point clouds based on two-dimensional convolutional neural networks. The three-dimensional information carried in the point cloud can be converted to two-dimensional, and then processed by a two-dimensional convolutional neural network, and finally back-projected to three-dimensional.In the process of projecting 3D information to 2D and back-projection, certain information loss will inevitably be caused to the point cloud and category inconsistency will be introduced in the back-projection stage;Another solution is the voxel-based point cloud segmentation method, which divides the point cloud into small grids one by one.However, the point cloud is sparse, and the direct use of 3D convolutional neural network inevitably wastes computing resources. In this paper, we propose a feature extraction module based on multi-scale ultra-sparse convolution and a feature selection module based on channel attention, and build a point cloud segmentation network framework based on this.By introducing multi-scale sparse convolution, network could capture richer feature information based on convolution kernels of different sizes, improving the segmentation result of point cloud segmentation.

updated: Mon May 09 2022 14:10:28 GMT+0000 (UTC)

published: Tue May 03 2022 15:01:20 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト