Continuous Spatiotemporal Transformers

Antonio H. de O. Fonseca; Emanuele Zappala; Josue Ortega Caro; David van Dijk

連続時空間トランスフォーマー

時空間動的システムのモデル化は、機械学習における基本的な課題です。 Transformer モデルは、データの解釈可能な表現を提供する NLP とコンピュータービジョンで非常に成功しています。ただし、連続動的システムのモデル化における変換器の制限は、それらが基本的に離散時間および空間モデルであるため、連続サンプリングに関する保証がないことです。この課題に対処するために、連続システムのモデリング用に設計された新しいトランスフォーマーアーキテクチャである、Continuous Spatiotemporal Transformer (CST) を紹介します。この新しいフレームワークは、ソボレフ空間での最適化により、継続的でスムーズな出力を保証します。従来のトランスフォーマーや他の時空間ダイナミクスモデリング手法に対して CST をベンチマークし、カルシウムイメージングデータからの脳ダイナミクスの学習を含む、合成および実際のシステムでの多くのタスクで優れたパフォーマンスを達成します。

Modeling spatiotemporal dynamical systems is a fundamental challenge in machine learning. Transformer models have been very successful in NLP and computer vision where they provide interpretable representations of data. However, a limitation of transformers in modeling continuous dynamical systems is that they are fundamentally discrete time and space models and thus have no guarantees regarding continuous sampling. To address this challenge, we present the Continuous Spatiotemporal Transformer (CST), a new transformer architecture that is designed for the modeling of continuous systems. This new framework guarantees a continuous and smooth output via optimization in Sobolev space. We benchmark CST against traditional transformers as well as other spatiotemporal dynamics modeling methods and achieve superior performance in a number of tasks on synthetic and real systems, including learning brain dynamics from calcium imaging data.

updated: Fri Jul 28 2023 21:38:47 GMT+0000 (UTC)

published: Tue Jan 31 2023 00:06:56 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト