ReLMoGen: Leveraging Motion Generation in Reinforcement Learning for Mobile Manipulation

Fei Xia; Chengshu Li; Roberto Martín-Martín; Or Litany; Alexander Toshev; Silvio Savarese

ReLMoGen：モバイル操作のための強化学習におけるモーション生成の活用

多くの強化学習（RL）アプローチは、連続制御タスクのアクションスペースとして関節制御信号（位置、速度、トルク）を使用します。モーションジェネレーター（モーションプランナーと軌道エクゼキューターの組み合わせ）のサブゴールの形でアクションスペースをより高いレベルに引き上げることを提案します。アクションスペースを持ち上げ、サンプリングベースのモーションプランナーを活用することで、RLを効率的に使用して、元のアクションスペースの既存のRLメソッドでは解決できなかった複雑で長期的なタスクを解決できると私たちは主張します。私たちはReLMoGenを提案します-サブゴールを予測するために学習したポリシーと、これらのサブゴールに到達するために必要なモーションを計画して実行するためのモーションジェネレーターを組み合わせるフレームワークです。メソッドを検証するために、ReLMoGenを2つのタイプのタスクに適用します。1）インタラクティブなナビゲーションタスク、目的地に到達するために環境との相互作用が必要なナビゲーションの問題、および2）モバイル操作タスク、ロボットベースの移動が必要な操作タスク。これらの問題は、通常、地平線が長く、トレーニング中に調査するのが難しく、ナビゲーションと対話の交互のフェーズで構成されるため、困難です。私たちの方法は、写真のようなシミュレーション環境での7つのロボットタスクの多様なセットでベンチマークされています。すべての設定において、ReLMoGenは最先端の強化学習および階層強化学習のベースラインよりも優れています。 ReLMoGenは、テスト時にさまざまなモーションジェネレーター間で優れた転送可能性も示し、実際のロボットに転送する大きな可能性を示しています。

Many Reinforcement Learning (RL) approaches use joint control signals (positions, velocities, torques) as action space for continuous control tasks. We propose to lift the action space to a higher level in the form of subgoals for a motion generator (a combination of motion planner and trajectory executor). We argue that, by lifting the action space and by leveraging sampling-based motion planners, we can efficiently use RL to solve complex, long-horizon tasks that could not be solved with existing RL methods in the original action space. We propose ReLMoGen -- a framework that combines a learned policy to predict subgoals and a motion generator to plan and execute the motion needed to reach these subgoals. To validate our method, we apply ReLMoGen to two types of tasks: 1) Interactive Navigation tasks, navigation problems where interactions with the environment are required to reach the destination, and 2) Mobile Manipulation tasks, manipulation tasks that require moving the robot base. These problems are challenging because they are usually long-horizon, hard to explore during training, and comprise alternating phases of navigation and interaction. Our method is benchmarked on a diverse set of seven robotics tasks in photo-realistic simulation environments. In all settings, ReLMoGen outperforms state-of-the-art Reinforcement Learning and Hierarchical Reinforcement Learning baselines. ReLMoGen also shows outstanding transferability between different motion generators at test time, indicating a great potential to transfer to real robots.

updated: Fri Mar 26 2021 04:44:22 GMT+0000 (UTC)

published: Tue Aug 18 2020 08:05:15 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト