Multiple View Performers for Shape Completion

David Watkins-Valls; Peter Allen; Krzysztof Choromanski; Jacob Varley; Nicholas Waytowich

形状補完のための複数のビュー実行者

一連の時間的に連続したビューから 3D 形状を完成させるための新しいアーキテクチャである Multiple View Performer (MVP) を提案します。 MVP は、Performers と呼ばれる線形注意トランスフォーマーを使用して、このタスクを実行します。私たちのモデルは、シーンの現在の観測を以前の観測に合わせて、より正確な埋め込みを可能にします。過去の観測履歴は、最新の連続ホップフィールドメモリに近似するコンパクトな連想メモリを介して圧縮されますが、サイズは履歴の長さとはまったく無関係です。 MVP が提供する一般化の利点を実証するために、時間の経過に伴う形状完成のいくつかのベースラインと私たちのモデルを比較します。私たちの知る限りでは、MVP は、複数の深度ビューの登録を必要としない最初の複数ビューボクセル再構成方法であり、3D 形状完成のための最初の因果トランスフォーマーベースのモデルです。

We propose the Multiple View Performer (MVP) - a new architecture for 3D shape completion from a series of temporally sequential views. MVP accomplishes this task by using linear-attention Transformers called Performers. Our model allows the current observation of the scene to attend to the previous ones for more accurate infilling. The history of past observations is compressed via the compact associative memory approximating modern continuous Hopfield memory, but crucially of size independent from the history length. We compare our model with several baselines for shape completion over time, demonstrating the generalization gains that MVP provides. To the best of our knowledge, MVP is the first multiple view voxel reconstruction method that does not require registration of multiple depth views and the first causal Transformer based model for 3D shape completion.

updated: Tue Sep 13 2022 20:45:24 GMT+0000 (UTC)

published: Tue Sep 13 2022 20:45:24 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト