Frame Averaging for Equivariant Shape Space Learning

Matan Atzmon; Koki Nagano; Sanja Fidler; Sameh Khamis; Yaron Lipman

同変形状空間学習のためのフレーム平均化

形状空間学習のタスクには、一連の形状のセットを、優れた一般化プロパティを使用して潜在表現空間との間でマッピングすることが含まれます。多くの場合、実際の形状のコレクションには対称性があります。これは、形状の本質を変えない変換として定義できます。形状空間の学習に対称性を組み込む自然な方法は、形状空間へのマッピング（エンコーダー）と形状空間からのマッピング（デコーダー）が関連する対称性と同等であることを確認することです。この論文では、2つの貢献を紹介することにより、エンコーダーとデコーダーに同変を組み込むためのフレームワークを提示します。（ii）形状のさまざまな部分に適用される区分的ユークリッド運動と同変のオートエンコーダを構築します。私たちの知る限り、これは最初の完全区分的ユークリッド同変オートエンコーダー構造です。フレームワークのトレーニングは簡単です。標準の再構築損失を使用し、新しい損失を導入する必要はありません。私たちのアーキテクチャは、標準（バックボーン）アーキテクチャで構築されており、適切なフレーム平均を使用して同変にします。暗黙のニューラル表現を使用した剛体形状データセットとメッシュベースのニューラルネットワークを使用した関節形状データセットの両方でフレームワークをテストすると、目に見えないテスト形状への最先端の一般化が示され、関連するベースラインが大幅に改善されます。特に、私たちの方法は、目に見えない関節のあるポーズへの一般化において大幅な改善を示しています。

The task of shape space learning involves mapping a train set of shapes to and from a latent representation space with good generalization properties. Often, real-world collections of shapes have symmetries, which can be defined as transformations that do not change the essence of the shape. A natural way to incorporate symmetries in shape space learning is to ask that the mapping to the shape space (encoder) and mapping from the shape space (decoder) are equivariant to the relevant symmetries. In this paper, we present a framework for incorporating equivariance in encoders and decoders by introducing two contributions: (i) adapting the recent Frame Averaging (FA) framework for building generic, efficient, and maximally expressive Equivariant autoencoders; and (ii) constructing autoencoders equivariant to piecewise Euclidean motions applied to different parts of the shape. To the best of our knowledge, this is the first fully piecewise Euclidean equivariant autoencoder construction. Training our framework is simple: it uses standard reconstruction losses and does not require the introduction of new losses. Our architectures are built of standard (backbone) architectures with the appropriate frame averaging to make them equivariant. Testing our framework on both rigid shapes dataset using implicit neural representations, and articulated shape datasets using mesh-based neural networks show state-of-the-art generalization to unseen test shapes, improving relevant baselines by a large margin. In particular, our method demonstrates significant improvement in generalizing to unseen articulated poses.

updated: Fri Dec 03 2021 06:41:19 GMT+0000 (UTC)

published: Fri Dec 03 2021 06:41:19 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト