GeoSpark: Sparking up Point Cloud Segmentation with Geometry Clue

Zhening Huang; Xiaoyang Wu; Hengshuang Zhao; Lei Zhu; Shujun Wang; Georgios Hadjidemetriou; Ioannis Brilakis

GeoSpark: Geometry Clue を使用した点群セグメンテーションの起動

現在のポイントクラウドセグメンテーションアーキテクチャは、地域の近隣との情報の集約にほとんど依存しているため、長距離フィーチャモデリングが制限されているという問題があります。さらに、複数のスケールでポイントの特徴を学習するために、ほとんどの方法では、データに依存しないサンプリングアプローチを利用して、各段階の後にポイントの数を減らします。ただし、このようなサンプリング方法では、初期段階で小さなオブジェクトのポイントが破棄されることが多く、機能学習が不十分になります。これらの問題は、ガイダンスとして明示的なジオメトリの手がかりを導入することで軽減できると考えています。この目的のために、GeoSpark を提案します。GeoSpark は、Geometry の手がかりをネットワークに組み込み、機能の学習とダウンサンプリングを促進するプラグインモジュールです。 GeoSpark は、さまざまなバックボーンに簡単に統合できます。特徴集約では、ネットワークがローカルポイントと隣接するジオメトリパーティションの両方から学習できるようにすることで特徴モデリングを改善し、データに合わせた受容野を拡大します。さらに、GeoSpark はジオメトリパーティション情報を利用してダウンサンプリングプロセスをガイドします。このプロセスでは、一意の機能を持つポイントが保持され、冗長なポイントが融合されるため、ネットワーク全体でキーポイントがより適切に保持されます。 PointNet++、KPConv、PointTransformer などのさまざまなバックボーンに GeoSpark を追加した後、一貫した改善が見られました。特に、Point Transformer と統合した場合、当社の GeoSpark モジュールは、ScanNetv2 データセットで 74.7% の mIoU (4.1% 改善)、S3DIS Area 5 データセットで 71.5% の mIoU (1.1% 改善) を達成し、両方のベンチマークでトップにランクされました。コードとモデルは公開されます。

Current point cloud segmentation architectures suffer from limited long-range feature modeling, as they mostly rely on aggregating information with local neighborhoods. Furthermore, in order to learn point features at multiple scales, most methods utilize a data-agnostic sampling approach to decrease the number of points after each stage. Such sampling methods, however, often discard points for small objects in the early stages, leading to inadequate feature learning. We believe these issues are can be mitigated by introducing explicit geometry clues as guidance. To this end, we propose GeoSpark, a Plug-in module that incorporates Geometry clues into the network to Spark up feature learning and downsampling. GeoSpark can be easily integrated into various backbones. For feature aggregation, it improves feature modeling by allowing the network to learn from both local points and neighboring geometry partitions, resulting in an enlarged data-tailored receptive field. Additionally, GeoSpark utilizes geometry partition information to guide the downsampling process, where points with unique features are preserved while redundant points are fused, resulting in better preservation of key points throughout the network. We observed consistent improvements after adding GeoSpark to various backbones including PointNet++, KPConv, and PointTransformer. Notably, when integrated with Point Transformer, our GeoSpark module achieves a 74.7% mIoU on the ScanNetv2 dataset (4.1% improvement) and 71.5% mIoU on the S3DIS Area 5 dataset (1.1% improvement), ranking top on both benchmarks. Code and models will be made publicly available.

updated: Tue Mar 14 2023 23:30:46 GMT+0000 (UTC)

published: Tue Mar 14 2023 23:30:46 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト