MoVie: Visual Model-Based Policy Adaptation for View Generalization

Sizhe Yang; Yanjie Ze; Huazhe Xu

MoVie: ビューの一般化のためのビジュアルモデルベースのポリシー適応

限られたビューで訓練された視覚強化学習 (RL) エージェントは、学習した能力を目に見えないビューに一般化する際に大きな課題に直面しています。この固有の困難は、ビューの一般化の問題として知られています。この研究では、この根本的な問題を、現実世界の状況によく似た 4 つの異なる非常に困難なシナリオに体系的に分類します。続いて、トレーニング中に明示的な報酬信号や変更を必要とせず、テスト中にビュー一般化 (MoVie) 用の視覚的なモデルベースのポリシーを適切に適応できるようにする、単純かつ効果的なアプローチを提案します。私たちの手法は、DMControl、xArm、Adroit をソースとする合計 18 のタスクを含む 4 つのシナリオすべてで大幅な進歩を示し、それぞれ 33%、86%、152% の相対的な改善を示しました。この優れた結果は、現実世界のロボット工学アプリケーションに対する当社のアプローチの計り知れない可能性を浮き彫りにしています。ビデオは https://yangsizhe.github.io/MoVie/ でご覧いただけます。

Visual Reinforcement Learning (RL) agents trained on limited views face significant challenges in generalizing their learned abilities to unseen views. This inherent difficulty is known as the problem of view generalization. In this work, we systematically categorize this fundamental problem into four distinct and highly challenging scenarios that closely resemble real-world situations. Subsequently, we propose a straightforward yet effective approach to enable successful adaptation of visual Model-based policies for View generalization (MoVie) during test time, without any need for explicit reward signals and any modification during training time. Our method demonstrates substantial advancements across all four scenarios encompassing a total of 18 tasks sourced from DMControl, xArm, and Adroit, with a relative improvement of 33%, 86%, and 152% respectively. The superior results highlight the immense potential of our approach for real-world robotics applications. Videos are available at https://yangsizhe.github.io/MoVie/ .

updated: Wed Sep 27 2023 09:27:14 GMT+0000 (UTC)

published: Mon Jul 03 2023 12:44:07 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト