No RL, No Simulation: Learning to Navigate without Navigating

Meera Hahn; Devendra Chaplot; Shubham Tulsiani; Mustafa Mukadam; James M. Rehg; Abhinav Gupta

RLなし、シミュレーションなし：ナビゲートせずにナビゲートすることを学ぶ

ナビゲーションポリシーを学習するための以前のほとんどの方法では、オンラインポリシーの相互作用が必要であり、報酬をグラウンドトゥルースマップに依存しているため、シミュレーション環境にアクセスする必要があります。ただし、シミュレータの構築には費用がかかり（シーンごとに手作業が必要）、シミュレーションと実際のドメインのギャップのために、学習したポリシーを現実世界のロボットプラットフォームに転送する際に課題が生じます。この論文では、簡単な質問を投げかけます。画像目標ナビゲーションタスクを解決するために、アクティブなインタラクション、グラウンドトゥルースマップ、さらには強化学習（RL）が本当に必要ですか？ローミングのパッシブビデオのみからナビゲートすることを学ぶための自己監視アプローチを提案します。私たちのアプローチであるNoRL、No Simulator（NRNS）は、シンプルでスケーラブルでありながら、非常に効果的です。 NRNSは、RLベースの製剤を大幅に上回っています。 RLまたはシミュレーションを使用する将来の画像ベースのナビゲーションタスクの強力なベースラインとしてNRNSを提示します。

Most prior methods for learning navigation policies require access to simulation environments, as they need online policy interaction and rely on ground-truth maps for rewards. However, building simulators is expensive (requires manual effort for each and every scene) and creates challenges in transferring learned policies to robotic platforms in the real-world, due to the sim-to-real domain gap. In this paper, we pose a simple question: Do we really need active interaction, ground-truth maps or even reinforcement-learning (RL) in order to solve the image-goal navigation task? We propose a self-supervised approach to learn to navigate from only passive videos of roaming. Our approach, No RL, No Simulator (NRNS), is simple and scalable, yet highly effective. NRNS outperforms RL-based formulations by a significant margin. We present NRNS as a strong baseline for any future image-based navigation tasks that use RL or Simulation.

updated: Mon Oct 18 2021 17:04:06 GMT+0000 (UTC)

published: Mon Oct 18 2021 17:04:06 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト