Scalable SoftGroup for 3D Instance Segmentation on Point Clouds

Thang Vu; Kookhoi Kim; Tung M. Luu; Thanh Nguyen; Junyeong Kim; Chang D. Yoo

点群での 3D インスタンスセグメンテーションのためのスケーラブルな SoftGroup

このホワイトペーパーでは、正確でスケーラブルな 3D インスタンスセグメンテーションのために、SoftGroup と呼ばれるネットワークを検討します。既存の最先端の方法は、厳密なセマンティック予測を生成し、続いてグループ化してインスタンスのセグメンテーション結果を取得します。ただし、難しい決定に起因するエラーはグループ化に伝播し、その結果、予測されたインスタンスとグラウンドトゥルースとの重なりが少なくなり、実質的な誤検知が発生します。前述の問題に対処するために、SoftGroup では各ポイントを複数のクラスに関連付けて、セマンティック予測エラーに起因する問題を軽減し、偽陽性インスタンスをバックグラウンドとして分類することを学習することで抑制します。スケーラビリティに関しては、既存の高速な方法では、大規模なシーンで数十秒程度の計算時間が必要であり、これは不十分であり、リアルタイムに適用するにはほど遠いものです。私たちの発見は、グループ化の前提条件として機能する k-Nearest Neighbor (k-NN) モジュールが計算上のボトルネックを導入することです。 SoftGroup は、SoftGroup++ と呼ばれるこの計算上のボトルネックを解決するために拡張されています。提案された SoftGroup++ は、octree k-NN で時間の複雑さを軽減し、クラス認識ピラミッドスケーリングと後期デボクセル化で検索スペースを削減します。さまざまな屋内および屋外のデータセットに関する実験結果は、提案された SoftGroup および SoftGroup++ の有効性と一般性を示しています。それらのパフォーマンスは、AP_50 に関して、最強のベースラインを大幅に (6% ～ 16%) 上回っています。大規模なシーンを含むデータセットでは、SoftGroup++ は SoftGroup と比較して平均で 6 倍の速度向上を実現します。さらに、SoftGroup を拡張して、オブジェクト検出とパノプティックセグメンテーションを実行し、既存の方法を大幅に改善することができます。ソースコードとトレーニング済みモデルは、https://github.com/thangvubk/SoftGroup で入手できます。

This paper considers a network referred to as SoftGroup for accurate and scalable 3D instance segmentation. Existing state-of-the-art methods produce hard semantic predictions followed by grouping to obtain instance segmentation results. However, the errors stemming from hard decisions propagate into grouping that results in low overlaps of the predicted instances with the ground truth and substantial false positives. To address the aforementioned problems, SoftGroup allows each point to be associated with multiple classes to mitigate the problem stemming from semantic prediction errors and suppresses false positive instances by learning to categorize them as background. Regarding scalability, the existing fast methods require computational time on the order of tens of seconds on large-scale scenes, which is unsatisfactory and far from applicable for real-time. Our finding is that the k-Nearest Neighbor (k-NN) module, which serves as the prerequisite of grouping, introduces computational bottleneck. SoftGroup is extended to resolve this computational bottleneck, which is referred to as SoftGroup++. The proposed SoftGroup++ reduces time complexity with octree k-NN and reduces search space with class-aware pyramid scaling and late devoxelization. Experimental results on various indoor and outdoor datasets demonstrate the efficacy and generality of the proposed SoftGroup and SoftGroup++. Their performances surpass the strongest baseline by a large margin (6% ∼ 16%) in terms of AP_50. On datasets with large-scale scenes, SoftGroup++ achieves 6× speed boost on average compared to SoftGroup. Furthermore, SoftGroup can be extended to perform object detection and panoptic segmentation with nontrivial improvements over existing methods. The source code and trained models are available at https://github.com/thangvubk/SoftGroup.

updated: Sun Nov 13 2022 04:57:35 GMT+0000 (UTC)

published: Sat Sep 17 2022 06:36:18 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト