SLAMP: Stochastic Latent Appearance and Motion Prediction

Adil Kaan Akan; Erkut Erdem; Aykut Erdem; Fatma Güney

SLAMP：確率的潜在的外観と動き予測

モーションはビデオ予測の重要な手がかりであり、多くの場合、ビデオコンテンツを静的コンポーネントと動的コンポーネントに分離することで利用されます。モーションを利用したこれまでの作業のほとんどは決定論的ですが、将来の固有の不確実性をモデル化できる確率論的方法があります。既存の確率モデルは、運動について明示的に推論しないか、静的部分について限定的な仮定をします。本論文では、動きの履歴に基づいて未来を予測することにより、ビデオの外観と動きについて確率的に推論します。履歴のない動きについての明示的な推論は、すでに現在の確率モデルのパフォーマンスに達しています。モーション履歴は、数フレーム先の一貫したダイナミクスを予測できるようにすることで、結果をさらに改善します。私たちのモデルは、一般的なビデオ予測データセットの最先端モデルと同等のパフォーマンスを発揮しますが、複雑な動きと動的な背景を持つ2つの挑戦的な現実世界の自動運転データセットのモデルを大幅に上回っています。

Motion is an important cue for video prediction and often utilized by separating video content into static and dynamic components. Most of the previous work utilizing motion is deterministic but there are stochastic methods that can model the inherent uncertainty of the future. Existing stochastic models either do not reason about motion explicitly or make limiting assumptions about the static part. In this paper, we reason about appearance and motion in the video stochastically by predicting the future based on the motion history. Explicit reasoning about motion without history already reaches the performance of current stochastic models. The motion history further improves the results by allowing to predict consistent dynamics several frames into the future. Our model performs comparably to the state-of-the-art models on the generic video prediction datasets, however, significantly outperforms them on two challenging real-world autonomous driving datasets with complex motion and dynamic background.

updated: Thu Aug 05 2021 17:52:18 GMT+0000 (UTC)

published: Thu Aug 05 2021 17:52:18 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト