Point Transformer

Hengshuang Zhao; Li Jiang; Jiaya Jia; Philip Torr; Vladlen Koltun

ポイントトランスフォーマ

Point Transformer

自己注意ネットワークは自然言語処理に革命をもたらし、画像分類や物体検出などの画像解析タスクにおいて目覚ましい進歩を遂げている。この成功に触発されて、我々は自己注意ネットワークの3次元点群処理への応用を検討している。点群のための自己注意層を設計し、それを使用して、意味的なシーンのセグメンテーション、オブジェクト部分のセグメンテーション、オブジェクトの分類などのタスクのための自己注意ネットワークを構築する。我々のポイントトランスフォーマの設計は、領域やタスクにまたがる先行研究を改善する。例えば、大規模な意味シーンセグメンテーションのための困難なS3DISデータセットでは、ポイントトランスフォーマはエリア5で70.4%のmIoUを達成し、最強の先行モデルを3.3絶対パーセンテージポイント上回り、初めて70%のmIoUのしきい値を超えた。

Self-attention networks have revolutionized natural language processing and are making impressive strides in image analysis tasks such as image classification and object detection. Inspired by this success, we investigate the application of self-attention networks to 3D point cloud processing. We design self-attention layers for point clouds and use these to construct self-attention networks for tasks such as semantic scene segmentation, object part segmentation, and object classification. Our Point Transformer design improves upon prior work across domains and tasks. For example, on the challenging S3DIS dataset for large-scale semantic scene segmentation, the Point Transformer attains an mIoU of 70.4% on Area 5, outperforming the strongest prior model by 3.3 absolute percentage points and crossing the 70% mIoU threshold for the first time.

updated: Wed Dec 16 2020 18:58:56 GMT+0000 (UTC)

published: Wed Dec 16 2020 18:58:56 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト