Taking Visual Motion Prediction To New Heightfields

Sebastien Ehrhardt; Aron Monszpart; Niloy Mitra; Andrea Vedaldi

新しい高さフィールドへの視覚運動予測の採用

ニュートン力学の基本法則はよく理解されていますが、物理的なシナリオを説明するには、適切な方程式を使用して問題を手動でモデル化し、関連するパラメーターを推定する必要があります。このような物理関連のコンテキストで人工知能技術の近似機能を活用できるようにするために、研究者は関連する状態を手作りし、シミュレーションの実行をトレーニングデータとして使用して状態遷移を学習するためにニューラルネットワークを使用しました。残念ながら、このようなアプローチは、関連する状態空間を手動で作成するのが面倒で困難な傾向がある複雑な現実世界のシナリオのモデリングには適していません。この作業では、ニューラルネットワークが、不均一な環境を内部的にモデル化し、その過程で長期的な物理的外挿を可能にしながら、視覚データのみに基づいて現実世界の機械的プロセスの物理的状態を暗黙的に学習できるかどうかを調査します。このタスクのためにリカレントニューラルネットワークアーキテクチャを開発し、結果として生じる不確実性を進化する分散推定の形で特徴付けます。さまざまな形状と向きのボウル、および入力として画像のみを使用する任意の高さフィールドでのローリングボールの動きを推定するためのセットアップを評価します。予測の精度とシナリオの複雑さの両方の点で、既存の画像ベースの方法に比べて大幅な改善が報告されています。そして、私たちとは異なり、内部の物理的状態へのアクセスを想定したアプローチで競争力のあるパフォーマンスを報告します。

While the basic laws of Newtonian mechanics are well understood, explaining a physical scenario still requires manually modeling the problem with suitable equations and estimating the associated parameters. In order to be able to leverage the approximation capabilities of artificial intelligence techniques in such physics related contexts, researchers have handcrafted the relevant states, and then used neural networks to learn the state transitions using simulation runs as training data. Unfortunately, such approaches are unsuited for modeling complex real-world scenarios, where manually authoring relevant state spaces tend to be tedious and challenging. In this work, we investigate if neural networks can implicitly learn physical states of real-world mechanical processes only based on visual data while internally modeling non-homogeneous environment and in the process enable long-term physical extrapolation. We develop a recurrent neural network architecture for this task and also characterize resultant uncertainties in the form of evolving variance estimates. We evaluate our setup to extrapolate motion of rolling ball(s) on bowls of varying shape and orientation, and on arbitrary heightfields using only images as input. We report significant improvements over existing image-based methods both in terms of accuracy of predictions and complexity of scenarios; and report competitive performance with approaches that, unlike us, assume access to internal physical states.

updated: Fri Dec 10 2021 19:01:23 GMT+0000 (UTC)

published: Fri Dec 22 2017 13:22:14 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト