Classification of Single-View Object Point Clouds

Zelin Xu; Ke Chen; Kangjun Liu; Changxing Ding; Yaowei Wang; Kui Jia

シングルビューオブジェクトポイントクラウドの分類

ModelNetやShapeNetなどのベンチマークデータセットのリリース以来、オブジェクトポイントクラウド分類は大きな研究の注目を集めています。これらのベンチマークは、オブジェクトインスタンスの完全な表面をカバーする点群を想定しており、そのために多くの高性能なメソッドが開発されています。ただし、それらの設定は、（自己）閉塞のために、オブジェクトの部分的な表面を覆う点群が任意のビューからキャプチャされる、実際によく見られる設定とは異なります。このホワイトペーパーでは、既存の点群分類器のパフォーマンスが、考慮されている単一ビューの部分的な設定の下で大幅に低下することを示します。この現象は、部分的なオブジェクトサーフェスのセマンティックカテゴリが、サーフェス全体での分布が明確に指定されている場合にのみ、あいまいさが少なくなるという観察結果と一致しています。この目的のために、オブジェクトポーズ推定の教師あり学習に分類を伴う必要がある単一ビューの部分的な設定について議論します。技術的には、ポーズを伴う点群分類ネットワーク（PAPNet）のベースライン手法を提案します。 SE（3）-同変畳み込みに基づいて構築されたPAPNetは、ベクトル場で定義された同変特徴の中間ポーズ変換を学習します。これにより、カテゴリレベルの正規ポーズでの後続の分類が（理想的には）容易になります。既存のModelNet40およびScanNetデータセットを単一ビューの部分的な設定に適合させることにより、実験結果は、オブジェクトのポーズ推定の必要性と、既存の分類器に対するPAPNetの優位性を検証できます。

Object point cloud classification has drawn great research attention since the release of benchmarking datasets, such as the ModelNet and the ShapeNet. These benchmarks assume point clouds covering complete surfaces of object instances, for which plenty of high-performing methods have been developed. However, their settings deviate from those often met in practice, where, due to (self-)occlusion, a point cloud covering partial surface of an object is captured from an arbitrary view. We show in this paper that performance of existing point cloud classifiers drops drastically under the considered single-view, partial setting; the phenomenon is consistent with the observation that semantic category of a partial object surface is less ambiguous only when its distribution on the whole surface is clearly specified. To this end, we argue for a single-view, partial setting where supervised learning of object pose estimation should be accompanied with classification. Technically, we propose a baseline method of Pose-Accompanied Point cloud classification Network (PAPNet); built upon SE(3)-equivariant convolutions, the PAPNet learns intermediate pose transformations for equivariant features defined on vector fields, which makes the subsequent classification easier (ideally) in the category-level, canonical pose. By adapting existing ModelNet40 and ScanNet datasets to the single-view, partial setting, experiment results can verify the necessity of object pose estimation and superiority of our PAPNet to existing classifiers.

updated: Sun Feb 27 2022 11:40:09 GMT+0000 (UTC)

published: Fri Dec 18 2020 04:00:56 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト