Revisiting Transformation Invariant Geometric Deep Learning: Are Initial Representations All You Need?

Ziwei Zhang; Xin Wang; Zeyang Zhang; Peng Cui; Wenwu Zhu

変換不変の幾何学的深層学習の再考：最初の表現はあなたが必要とするすべてですか？

幾何学的ディープラーニング、つまり、点群やグラフなどのユビキタスな幾何学的データを処理するニューラルネットワークの設計は、過去10年間で大きな成功を収めてきました。重要な誘導バイアスの1つは、モデルが平行移動、回転、スケーリングなどのさまざまな変換に対して不変性を維持できることです。既存のグラフニューラルネットワーク（GNN）アプローチは、順列不変性のみを維持でき、他の変換に関して不変性を保証できません。 GNNに加えて、他の作品は、計算コストが高く、拡張が難しい、高度な変換不変レイヤーを設計します。この問題を解決するために、既存のニューラルネットワークが幾何学的データを処理するときに変換の不変性を維持できない理由を再検討します。私たちの調査結果は、高度な神経層の設計を必要とするのではなく、変換不変および距離保存の初期表現で変換不変を達成するのに十分であることを示しています。これらの発見に動機付けられて、幾何学的データの単純で一般的なフレームワークである変換不変ニューラルネットワーク（TinvNN）を提案します。具体的には、表現をニューラルネットワークにフィードする前に多次元尺度構成法を変更することにより、変換不変で距離を維持する初期点表現を実現します。 TinvNNは、変換の不変性を厳密に保証でき、既存のニューラルネットワークと組み合わせるのに十分な汎用性と柔軟性を備えていることを証明します。点群分析と組み合わせ最適化に関する広範な実験結果は、提案された方法の有効性と一般的な適用性を示しています。実験結果に基づいて、TinvNNは、変換不変の幾何学的深層学習のさらなる研究のための新しい出発点および不可欠なベースラインと見なされるべきであると提唱します。

Geometric deep learning, i.e., designing neural networks to handle the ubiquitous geometric data such as point clouds and graphs, have achieved great successes in the last decade. One critical inductive bias is that the model can maintain invariance towards various transformations such as translation, rotation, and scaling. The existing graph neural network (GNN) approaches can only maintain permutation-invariance, failing to guarantee invariance with respect to other transformations. Besides GNNs, other works design sophisticated transformation-invariant layers, which are computationally expensive and difficult to be extended. To solve this problem, we revisit why the existing neural networks cannot maintain transformation invariance when handling geometric data. Our findings show that transformation-invariant and distance-preserving initial representations are sufficient to achieve transformation invariance rather than needing sophisticated neural layer designs. Motivated by these findings, we propose Transformation Invariant Neural Networks (TinvNN), a straightforward and general framework for geometric data. Specifically, we realize transformation-invariant and distance-preserving initial point representations by modifying multi-dimensional scaling before feeding the representations into neural networks. We prove that TinvNN can strictly guarantee transformation invariance, being general and flexible enough to be combined with the existing neural networks. Extensive experimental results on point cloud analysis and combinatorial optimization demonstrate the effectiveness and general applicability of our proposed method. Based on the experimental results, we advocate that TinvNN should be considered a new starting point and an essential baseline for further studies of transformation-invariant geometric deep learning.

updated: Thu Dec 23 2021 03:52:33 GMT+0000 (UTC)

published: Thu Dec 23 2021 03:52:33 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト