Equivariant Point Network for 3D Point Cloud Analysis

Haiwei Chen; Shichen Liu; Weikai Chen; Hao Li

3D点群分析のための同変点ネットワーク

対称性のより大きなグループと同変である特徴は、最近の研究でより識別的で強力であることが示されています。ただし、高次の同変特徴には、指数関数的に増大する計算コストが伴うことがよくあります。さらに、回転と同等の機能を活用して3D形状の位置合わせタスクに取り組む方法については、まだあまり検討されていません。過去の多くのアプローチは、3D形状を整列させるために非同変または不変の記述子に基づいていましたが、そのようなタスクは同変フレームワークから大いに恩恵を受ける可能性があると主張します。この論文では、両方の問題に対処する点群分析のための効果的で実用的なSE（3）（3D平行移動と回転）同変ネットワークを提案します。最初に、SE（3）分離可能点畳み込みを提示します。これは、6D畳み込みを3Dユークリッド空間とSO（3）空間で交互に実行される2つの分離可能な畳み込み演算子に分解する新しいフレームワークです。これにより、パフォーマンスを低下させることなく、計算コストを大幅に削減できます。次に、同変特徴の表現力を効果的に活用するための注意層を導入します。ネットワークと共同でトレーニングされている間、アテンションレイヤーは、特徴空間に固有のローカルフレームを暗黙的に導出し、さまざまな位置合わせタスクに統合できるアテンションベクトルを生成します。私たちは、広範な研究と視覚的解釈を通じてアプローチを評価します。経験的結果は、提案されたモデルがさまざまなベンチマークで強力なベースラインを上回っていることを示しています

Features that are equivariant to a larger group of symmetries have been shown to be more discriminative and powerful in recent studies. However, higher-order equivariant features often come with an exponentially-growing computational cost. Furthermore, it remains relatively less explored how rotation-equivariant features can be leveraged to tackle 3D shape alignment tasks. While many past approaches have been based on either non-equivariant or invariant descriptors to align 3D shapes, we argue that such tasks may benefit greatly from an equivariant framework. In this paper, we propose an effective and practical SE(3) (3D translation and rotation) equivariant network for point cloud analysis that addresses both problems. First, we present SE(3) separable point convolution, a novel framework that breaks down the 6D convolution into two separable convolutional operators alternatively performed in the 3D Euclidean and SO(3) spaces. This significantly reduces the computational cost without compromising the performance. Second, we introduce an attention layer to effectively harness the expressiveness of the equivariant features. While jointly trained with the network, the attention layer implicitly derives the intrinsic local frame in the feature space and generates attention vectors that can be integrated into different alignment tasks. We evaluate our approach through extensive studies and visual interpretations. The empirical results demonstrate that our proposed model outperforms strong baselines in a variety of benchmarks

updated: Fri Apr 02 2021 10:22:01 GMT+0000 (UTC)

published: Thu Mar 25 2021 21:57:10 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト