Embodied Visual Navigation with Automatic Curriculum Learning in Real Environments

Steven D. Morad; Roberto Mecca; Rudra P. K. Poudel; Stephan Liwicki; Roberto Cipolla

実環境での自動カリキュラム学習による具体化されたビジュアルナビゲーション

ナビゲーションタスクに合わせた自動カリキュラム学習の方法であるNavACLを紹介します。 NavACLはトレーニングが簡単で、幾何学的特徴を使用して関連するタスクを効率的に選択します。私たちの実験では、NavACLを使用してトレーニングされた深層強化学習エージェントは、現在の標準である均一なサンプリングでトレーニングされた最先端のエージェントを大幅に上回っています。さらに、エージェントは、RGB画像のみを使用して、未知の雑然とした屋内環境をナビゲートして、意味的に指定されたターゲットに到達できます。障害物回避ポリシーと凍結機能ネットワークは、変更や再トレーニングの要件なしに、目に見えない現実の環境への転送をサポートします。私たちはシミュレーションで、そして現実の世界で地上ロボットとクワッドロータードローンでポリシーを評価します。実際の結果のビデオは、補足資料で入手できます。

We present NavACL, a method of automatic curriculum learning tailored to the navigation task. NavACL is simple to train and efficiently selects relevant tasks using geometric features. In our experiments, deep reinforcement learning agents trained using NavACL significantly outperform state-of-the-art agents trained with uniform sampling -- the current standard. Furthermore, our agents can navigate through unknown cluttered indoor environments to semantically-specified targets using only RGB images. Obstacle-avoiding policies and frozen feature networks support transfer to unseen real-world environments, without any modification or retraining requirements. We evaluate our policies in simulation, and in the real world on a ground robot and a quadrotor drone. Videos of real-world results are available in the supplementary material.

updated: Wed Jan 06 2021 18:29:42 GMT+0000 (UTC)

published: Fri Sep 11 2020 13:28:26 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト