Predictive World Models from Real-World Partial Observations

Robin Karlsson; Alexander Carballo; Keisuke Fujii; Kento Ohtani; Kazuya Takeda

実世界の部分観測からの予測世界モデル

認知科学者は、人間のような適応可能な知的エージェントは、エージェントと環境の学習された因果的精神的シミュレーションを通じて推論を行うと考えています。このようなシミュレーションを学習する問題は、予測世界モデリングと呼ばれます。最近、世界モデルを活用する強化学習 (RL) エージェントが、ゲーム環境で SOTA パフォーマンスを達成しました。ただし、モバイルロボットに関連する複雑な現実世界の環境でワールドモデリングアプローチを適用する方法を理解することは未解決の問題です。この論文では、現実世界の道路環境の確率的予測世界モデルを学習するためのフレームワークを提示します。蓄積されたセンサー観測から、完全に観測されたもっともらしい世界の多様なセットを予測できる階層型 VAE (HVAE) を使用してモデルを実装します。以前の HVAE メソッドは、学習のグラウンドトゥルースとして完全な状態を必要としますが、HVAE が部分的に観測された状態のみから完全な状態を予測することを学習できるようにする新しいシーケンシャルトレーニングメソッドを提示します。決定論的領域の正確な空間構造予測を実験的に実証し、96.21 IoU を達成し、最良の予測を使用して、確率的領域の完全な予測までのギャップを 62% 縮めました。完全なグラウンドトゥルース状態が存在しない場合に HVAE を拡張することで、実世界のモバイルロボットアプリケーションの説明可能で包括的な予測世界モデルを実現するためのステップとして、空間予測の継続的な学習を促進します。コードは https://github.com/robin-karlsson0/predictive-world-models で入手できます。

Cognitive scientists believe adaptable intelligent agents like humans perform reasoning through learned causal mental simulations of agents and environments. The problem of learning such simulations is called predictive world modeling. Recently, reinforcement learning (RL) agents leveraging world models have achieved SOTA performance in game environments. However, understanding how to apply the world modeling approach in complex real-world environments relevant to mobile robots remains an open question. In this paper, we present a framework for learning a probabilistic predictive world model for real-world road environments. We implement the model using a hierarchical VAE (HVAE) capable of predicting a diverse set of fully observed plausible worlds from accumulated sensor observations. While prior HVAE methods require complete states as ground truth for learning, we present a novel sequential training method to allow HVAEs to learn to predict complete states from partially observed states only. We experimentally demonstrate accurate spatial structure prediction of deterministic regions achieving 96.21 IoU, and close the gap to perfect prediction by 62% for stochastic regions using the best prediction. By extending HVAEs to cases where complete ground truth states do not exist, we facilitate continual learning of spatial prediction as a step towards realizing explainable and comprehensive predictive world models for real-world mobile robotics applications. Code is available at https://github.com/robin-karlsson0/predictive-world-models.

updated: Wed Apr 26 2023 02:20:02 GMT+0000 (UTC)

published: Thu Jan 12 2023 02:07:26 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト