Semantic Segmentation for Point Cloud Scenes via Dilated Graph Feature Aggregation and Pyramid Decoders

Yongqiang Mao; Xian Sun; Wenhui Diao; Kaiqiang Chen; Zonghao Guo; Xiaonan Lu; Kun Fu

拡張グラフ特徴集約とピラミッドデコーダーによる点群シーンのセマンティックセグメンテーション

点群のセマンティックセグメンテーションは、各ポイントのカテゴリを密に予測することにより、シーンの包括的な理解を生み出します。受容野が単一であるため、点群のセマンティックセグメンテーションは、複数の受容野の特徴を表現するのに依然として困難であり、同様の空間構造を持つインスタンスの誤分類を引き起こします。この論文では、ピラミッドデコーダーを介して計算されたマルチベーシス集約損失（MALoss）によって導かれる、拡張グラフ特徴集約（DGFA）に根ざしたグラフ畳み込みネットワークDGFA-Netを提案します。マルチ受容野の特徴を構成するために、提案された拡張グラフ畳み込み（DGConv）を基本的な構成要素として採用するDGFAは、さまざまな受容領域を持つ拡張グラフをキャプチャすることにより、マルチスケールの特徴表現を集約するように設計されています。異なる解像度のポイントセットを計算ベースとして受容野情報にペナルティを課すことを同時に検討することにより、受容野ベースの多様性のためにMALossによって駆動されるピラミッドデコーダーを紹介します。これらの2つの側面を組み合わせることで、DGFA-Netは、同様の空間構造を持つインスタンスのセグメンテーションパフォーマンスを大幅に向上させます。 S3DIS、ShapeNetPart、Toronto-3Dでの実験では、DGFA-Netがベースラインアプローチを上回り、新しい最先端のセグメンテーションパフォーマンスを達成していることが示されています。

Semantic segmentation of point clouds generates comprehensive understanding of scenes through densely predicting the category for each point. Due to the unicity of receptive field, semantic segmentation of point clouds remains challenging for the expression of multi-receptive field features, which brings about the misclassification of instances with similar spatial structures. In this paper, we propose a graph convolutional network DGFA-Net rooted in dilated graph feature aggregation (DGFA), guided by multi-basis aggregation loss (MALoss) calculated through Pyramid Decoders. To configure multi-receptive field features, DGFA which takes the proposed dilated graph convolution (DGConv) as its basic building block, is designed to aggregate multi-scale feature representation by capturing dilated graphs with various receptive regions. By simultaneously considering penalizing the receptive field information with point sets of different resolutions as calculation bases, we introduce Pyramid Decoders driven by MALoss for the diversity of receptive field bases. Combining these two aspects, DGFA-Net significantly improves the segmentation performance of instances with similar spatial structures. Experiments on S3DIS, ShapeNetPart and Toronto-3D show that DGFA-Net outperforms the baseline approach, achieving a new state-of-the-art segmentation performance.

updated: Mon Aug 01 2022 04:22:29 GMT+0000 (UTC)

published: Mon Apr 11 2022 08:41:01 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト