Exploiting Local Geometry for Feature and Graph Construction for Better 3D Point Cloud Processing with Graph Neural Networks

Siddharth Srivastava; Gaurav Sharma

グラフニューラルネットワークによるより良い3D点群処理のための特徴とグラフ構築のためのローカルジオメトリの活用

3Dポイントクラウド処理のためのグラフニューラルネットワーク（GNN）の一般的なフレームワーク内で、ポイント表現とローカル近傍グラフ構築のシンプルで効果的な改善を提案します。最初の貢献として、点の重要な局所幾何学的情報で頂点表現を補強し、続いてMLPを使用して非線形射影を行うことを提案します。 2番目の貢献として、3D点群のGNNのグラフ構築を改善することを提案します。既存の方法は、局所近傍グラフを構築するためのk-nnベースのアプローチで機能します。シーンの一部の領域でセンサーによる高密度サンプリングの場合、カバレッジの低下につながる可能性があると主張します。提案された方法は、そのような問題に対抗し、そのような場合のカバレッジを改善することを目的としています。従来のGNNは、頂点に幾何学的解釈がない可能性がある一般的なグラフで機能するように設計されているため、両方の提案は、3D点群の幾何学的性質を組み込むために一般的なグラフを拡張するものと見なします。シンプルでありながら、複数の挑戦的なベンチマーク、比較的クリーンなCADモデル、および実世界のノイズの多いスキャンを使用して、提案された方法が3D分類（ModelNet40）、パーツセグメンテーション（ShapeNet）のベンチマークで最先端の結果を達成することを示します。およびセマンティックセグメンテーション（Stanford 3D Indoor Scenes Dataset）。また、提案されたネットワークがより高速なトレーニング収束、つまり分類のエポックを約40％削減することも示しています。プロジェクトの詳細はhttps://siddharthsrivastava.github.io/publication/geomgcnn/で入手できます。

We propose simple yet effective improvements in point representations and local neighborhood graph construction within the general framework of graph neural networks (GNNs) for 3D point cloud processing. As a first contribution, we propose to augment the vertex representations with important local geometric information of the points, followed by nonlinear projection using a MLP. As a second contribution, we propose to improve the graph construction for GNNs for 3D point clouds. The existing methods work with a k-nn based approach for constructing the local neighborhood graph. We argue that it might lead to reduction in coverage in case of dense sampling by sensors in some regions of the scene. The proposed methods aims to counter such problems and improve coverage in such cases. As the traditional GNNs were designed to work with general graphs, where vertices may have no geometric interpretations, we see both our proposals as augmenting the general graphs to incorporate the geometric nature of 3D point clouds. While being simple, we demonstrate with multiple challenging benchmarks, with relatively clean CAD models, as well as with real world noisy scans, that the proposed method achieves state of the art results on benchmarks for 3D classification (ModelNet40) , part segmentation (ShapeNet) and semantic segmentation (Stanford 3D Indoor Scenes Dataset). We also show that the proposed network achieves faster training convergence, i.e. ~40% less epochs for classification. The project details are available at https://siddharthsrivastava.github.io/publication/geomgcnn/

updated: Sun Mar 28 2021 21:34:59 GMT+0000 (UTC)

published: Sun Mar 28 2021 21:34:59 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト