Canonical Capsules: Self-Supervised Capsules in Canonical Pose

Weiwei Sun; Andrea Tagliasacchi; Boyang Deng; Sara Sabour; Soroosh Yazdani; Geoffrey Hinton; Kwang Moo Yi

カノニカルカプセル：カノニカルポーズの自己監視カプセル

3D点群の自己監視カプセルアーキテクチャを提案します。順列と同等の注意を介してオブジェクトのカプセル分解を計算し、ランダムに回転したオブジェクトのペアを使用してトレーニングすることにより、プロセスを自己監視します。私たちの重要なアイデアは、アテンションマスクをセマンティックキーポイントに集約し、これらを使用して、カプセルの不変性/同変特性を満たす分解を監視することです。これにより、意味的に一貫した分解のトレーニングが可能になるだけでなく、オブジェクト中心の推論を可能にする正規化操作を学習することもできます。ニューラルネットワークをトレーニングするには、分類ラベルも手動で調整したトレーニングデータセットも必要ありません。それでも、自己監視方式でオブジェクト中心の表現を学習することにより、私たちの方法は、3D点群の再構築、正規化、および教師なし分類の最先端を上回ります。

We propose a self-supervised capsule architecture for 3D point clouds. We compute capsule decompositions of objects through permutation-equivariant attention, and self-supervise the process by training with pairs of randomly rotated objects. Our key idea is to aggregate the attention masks into semantic keypoints, and use these to supervise a decomposition that satisfies the capsule invariance/equivariance properties. This not only enables the training of a semantically consistent decomposition, but also allows us to learn a canonicalization operation that enables object-centric reasoning. To train our neural network we require neither classification labels nor manually-aligned training datasets. Yet, by learning an object-centric representation in a self-supervised manner, our method outperforms the state-of-the-art on 3D point cloud reconstruction, canonicalization, and unsupervised classification.

updated: Wed Nov 24 2021 19:06:50 GMT+0000 (UTC)

published: Tue Dec 08 2020 20:13:28 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト