Learning Geometry-Disentangled Representation for Complementary Understanding of 3D Object Point Cloud

Mutian Xu; Junhao Zhang; Zhipeng Zhou; Mingye Xu; Xiaojuan Qi; Yu Qiao

3Dオブジェクト点群を補完的に理解するための幾何学の解きほぐされた表現の学習

2D画像処理では、エッジ部分と滑らかな部分をそれぞれ記述するために、画像を高周波成分と低周波成分に分解する試みがいくつかあります。同様に、椅子の境界や座席領域など、3Dオブジェクトの輪郭と平坦な領域は、異なるが補完的な形状を表します。ただし、このような調査は、すべてのポイントまたはローカルパッチを直接平等に扱うことによってポイントクラウドを理解する以前のディープネットワークでは失われます。この問題を解決するために、Geometry-Disentangled Attention Network（GDANet）を提案します。 GDANetは、Geometry-Disentangle Moduleを導入して、点群を3Dオブジェクトの輪郭部分と平坦部分に動的に解きほぐします。それぞれ、鋭く穏やかな変化成分で示されます。次に、GDANetは、シャープで穏やかなバリエーションコンポーネントの特徴を2つの全体的な表現と見なし、それぞれを元のポイントクラウドの特徴と融合させながら、それらに異なる注意を払う、Sharp-Gentle Complementary AttentionModuleを活用します。このようにして、私たちの方法は、ローカル情報を補足するために、2つの異なる解きほぐされたコンポーネントから全体的で補完的な3D幾何学的セマンティクスをキャプチャして改良します。 3Dオブジェクトの分類とセグメンテーションのベンチマークに関する広範な実験は、GDANetがより少ないパラメーターで最先端を達成することを示しています。コードはhttps://github.com/mutianxu/GDANetでリリースされています。

In 2D image processing, some attempts decompose images into high and low frequency components for describing edge and smooth parts respectively. Similarly, the contour and flat area of 3D objects, such as the boundary and seat area of a chair, describe different but also complementary geometries. However, such investigation is lost in previous deep networks that understand point clouds by directly treating all points or local patches equally. To solve this problem, we propose Geometry-Disentangled Attention Network (GDANet). GDANet introduces Geometry-Disentangle Module to dynamically disentangle point clouds into the contour and flat part of 3D objects, respectively denoted by sharp and gentle variation components. Then GDANet exploits Sharp-Gentle Complementary Attention Module that regards the features from sharp and gentle variation components as two holistic representations, and pays different attentions to them while fusing them respectively with original point cloud features. In this way, our method captures and refines the holistic and complementary 3D geometric semantics from two distinct disentangled components to supplement the local information. Extensive experiments on 3D object classification and segmentation benchmarks demonstrate that GDANet achieves the state-of-the-arts with fewer parameters. Code is released on https://github.com/mutianxu/GDANet.

updated: Sun Feb 07 2021 06:45:10 GMT+0000 (UTC)

published: Sun Dec 20 2020 13:35:00 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト