PointNeXt: Revisiting PointNet++ with Improved Training and Scaling Strategies

Guocheng Qian; Yuchen Li; Houwen Peng; Jinjie Mai; Hasan Abed Al Kader Hammoud; Mohamed Elhoseiny; Bernard Ghanem

PointNeXt: 改善されたトレーニングとスケーリング戦略による PointNet++ の再検討

PointNet++ は、点群を理解するための最も影響力のあるニューラルアーキテクチャの 1 つです。 PointNet++ の精度は、PointMLP や Point Transformer などの最近のネットワークによって大幅に凌駕されていますが、パフォーマンスの向上の大部分は、トレーニング戦略の改善、つまりデータの拡張と最適化の手法、およびアーキテクチャ上の問題ではなくモデルサイズの増加によるものであることがわかりました。イノベーション。したがって、PointNet++ の潜在能力はまだ完全に解明されていません。この作業では、モデルのトレーニングとスケーリング戦略の体系的な研究を通じて従来の PointNet++ を再検討し、2 つの主要な貢献を提供します。まず、PointNet++ のパフォーマンスを大幅に向上させる一連の改善されたトレーニング戦略を提案します。たとえば、アーキテクチャを変更することなく、ScanObjectNN オブジェクト分類での PointNet++ の全体的な精度 (OA) を 77.9% から 86.1% に上げることができ、最先端の PointMLP よりも優れていることを示しています。次に、反転残差ボトルネック設計と分離可能な MLP を PointNet++ に導入して、効率的かつ効果的なモデルのスケーリングを可能にし、PointNets の次のバージョンである PointNeXt を提案します。 PointNeXt は柔軟にスケールアップでき、3D 分類タスクとセグメンテーションタスクの両方で最先端の方法よりも優れています。分類では、PointNeXt は ScanObjectNN で 87.7 の全体的な精度に達し、PointMLP を 2.3% 上回っていますが、推論は 10 倍高速です。セマンティックセグメンテーションの場合、PointNeXt は S3DIS (6 分割交差検証) で 74.9% の平均 IoU という新しい最先端のパフォーマンスを確立し、最近の Point Transformer よりも優れています。コードとモデルは、https://github.com/guochengqian/pointnext で入手できます。

PointNet++ is one of the most influential neural architectures for point cloud understanding. Although the accuracy of PointNet++ has been largely surpassed by recent networks such as PointMLP and Point Transformer, we find that a large portion of the performance gain is due to improved training strategies, i.e. data augmentation and optimization techniques, and increased model sizes rather than architectural innovations. Thus, the full potential of PointNet++ has yet to be explored. In this work, we revisit the classical PointNet++ through a systematic study of model training and scaling strategies, and offer two major contributions. First, we propose a set of improved training strategies that significantly improve PointNet++ performance. For example, we show that, without any change in architecture, the overall accuracy (OA) of PointNet++ on ScanObjectNN object classification can be raised from 77.9% to 86.1%, even outperforming state-of-the-art PointMLP. Second, we introduce an inverted residual bottleneck design and separable MLPs into PointNet++ to enable efficient and effective model scaling and propose PointNeXt, the next version of PointNets. PointNeXt can be flexibly scaled up and outperforms state-of-the-art methods on both 3D classification and segmentation tasks. For classification, PointNeXt reaches an overall accuracy of 87.7 on ScanObjectNN, surpassing PointMLP by 2.3%, while being 10x faster in inference. For semantic segmentation, PointNeXt establishes a new state-of-the-art performance with 74.9% mean IoU on S3DIS (6-fold cross-validation), being superior to the recent Point Transformer. The code and models are available at https://github.com/guochengqian/pointnext.

updated: Wed Oct 12 2022 19:28:56 GMT+0000 (UTC)

published: Thu Jun 09 2022 17:59:54 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト