Learning to Map Natural Language Instructions to Physical Quadcopter   Control using Simulated Flight

Valts Blukis; Yannick Terme; Eyvind Niklasson; Ross A. Knepper; Yoav Artzi

シミュレートされたフライトを使用して自然言語命令を物理的なクワッドコプター制御にマップする学習

Learning to Map Natural Language Instructions to Physical Quadcopter Control using Simulated Flight

ナビゲーション指示と生の一人称観測を連続制御にマッピングするための、共同シミュレーションと実世界の学習フレームワークを提案します。私たちのモデルは、環境調査の必要性を推定し、実行中に環境位置を訪問する可能性を予測し、エージェントを制御して、可能性の高い位置を探索および訪問します。教師あり強化非同期学習（SuReAL）を紹介します。学習では、トレーニング中に物理環境で自律飛行を必要とせずにシミュレーション環境と実際の環境の両方を使用し、訪問する位置を予測するための教師あり学習と継続的な制御のための強化学習を組み合わせます。物理的なクワッドコプターを使用した自然言語の指示に従うタスクでのアプローチを評価し、効果的な実行および探索動作を示します。

We propose a joint simulation and real-world learning framework for mapping navigation instructions and raw first-person observations to continuous control. Our model estimates the need for environment exploration, predicts the likelihood of visiting environment positions during execution, and controls the agent to both explore and visit high-likelihood positions. We introduce Supervised Reinforcement Asynchronous Learning (SuReAL). Learning uses both simulation and real environments without requiring autonomous flight in the physical environment during training, and combines supervised learning for predicting positions to visit and reinforcement learning for continuous control. We evaluate our approach on a natural language instruction-following task with a physical quadcopter, and demonstrate effective execution and exploration behavior.

updated: Mon Oct 21 2019 21:19:33 GMT+0000 (UTC)

published: Mon Oct 21 2019 21:19:33 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト