NeoNav: Improving the Generalization of Visual Navigation via Generating Next Expected Observations

Qiaoyun Wu; Dinesh Manocha; Jun Wang; Kai Xu

NeoNav：予想される次の観測の生成による視覚ナビゲーションの一般化の改善

期待される次の観測を考えることによって導かれるエージェントを学習することにより、視覚ナビゲーションのクロスターゲットおよびクロスシーンの一般化を改善することを提案します。これは、NeoNavと呼ばれる変分ベイズモデルを学習することで達成されます。これは、エージェントとターゲットビューの現在の観測を条件とする次の期待観測（NEO）を生成します。生成モデルは、2つの主要な設計を含む変分目標を最適化することで学習されます。まず、潜在分布は現在の観測とターゲットビューに基づいて調整され、モデルベースのターゲット駆動ナビゲーションにつながります。第二に、潜在空間は、現在の観測と次善の行動を条件とするガウス分布の混合でモデル化されます。事後混合を事前に使用することにより、潜在空間の過剰な調整の問題を効果的に軽減し、新しいターゲットや新しいシーンのモデルの一般化を大幅に促進します。さらに、NEO生成は、エージェントと環境の相互作用のフォワードダイナミクスをモデル化します。これにより、近似推論の品質が向上し、データ効率が向上します。実際のベンチマークと合成ベンチマークの両方で広範な評価を実施し、成功率、データ効率、および一般化の点で、モデルが常に最新のモデルよりも優れていることを示しています。

We propose improving the cross-target and cross-scene generalization of visual navigation through learning an agent that is guided by conceiving the next observations it expects to see. This is achieved by learning a variational Bayesian model, called NeoNav, which generates the next expected observations (NEO) conditioned on the current observations of the agent and the target view. Our generative model is learned through optimizing a variational objective encompassing two key designs. First, the latent distribution is conditioned on current observations and the target view, leading to a model-based, target-driven navigation. Second, the latent space is modeled with a Mixture of Gaussians conditioned on the current observation and the next best action. Our use of mixture-of-posteriors prior effectively alleviates the issue of over-regularized latent space, thus significantly boosting the model generalization for new targets and in novel scenes. Moreover, the NEO generation models the forward dynamics of agent-environment interaction, which improves the quality of approximate inference and hence benefits data efficiency. We have conducted extensive evaluations on both real-world and synthetic benchmarks, and show that our model consistently outperforms the state-of-the-art models in terms of success rate, data efficiency, and generalization.

updated: Mon Jan 10 2022 04:10:11 GMT+0000 (UTC)

published: Mon Jun 17 2019 18:14:03 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト