Vision-Guided Quadrupedal Locomotion in the Wild with Multi-Modal Delay Randomization

Chieko Sarah Imai; Minghao Zhang; Yuchen Zhang; Marcin Kierebinski; Ruihan Yang; Yuzhe Qin; Xiaolong Wang

マルチモーダル遅延ランダム化による野生での視覚誘導四足歩行運動

さまざまな障害物、動的な環境、起伏のある地形など、複雑な環境で4ペダルロボット用の堅牢な視覚誘導コントローラーを開発することは非常に困難です。強化学習（RL）は、シミュレーションでのビジョン入力を使用したアジャイル移動スキルの有望なパラダイムを提供しますが、RLポリシーを現実の世界に展開することは依然として非常に困難です。私たちの重要な洞察は、ドメインギャップの不一致は別として、シミュレーションと実世界の間の視覚的な外観において、制御パイプラインからの遅延も問題の主な原因であるということです。この論文では、RLエージェントをトレーニングするときにこの問題に対処するためにマルチモーダル遅延ランダム化（MMDR）を提案します。具体的には、固有受容感覚と視覚の両方について、ランダム化された期間でサンプリングされた過去の観測を使用して、実際のハードウェアの遅延をシミュレートします。事前定義されたコントローラーや参照モーションを使用せずに、物理シミュレーターでエンドツーエンド制御のRLポリシーをトレーニングし、実際に稼働している実際のA1四重ロボットに直接展開します。複雑な地形や障害物があるさまざまな屋外環境でメソッドを評価します。ロボットが高速でスムーズに操縦し、障害物を回避し、ベースラインを大幅に上回っていることを示します。ビデオ付きのプロジェクトページはhttps://mehooz.github.io/mmdr-wild/です。

Developing robust vision-guided controllers for quadrupedal robots in complex environments, with various obstacles, dynamical surroundings and uneven terrains, is very challenging. While Reinforcement Learning (RL) provides a promising paradigm for agile locomotion skills with vision inputs in simulation, it is still very challenging to deploy the RL policy in the real world. Our key insight is that aside from the discrepancy in the domain gap, in visual appearance between the simulation and the real world, the latency from the control pipeline is also a major cause of difficulty. In this paper, we propose Multi-Modal Delay Randomization (MMDR) to address this issue when training RL agents. Specifically, we simulate the latency of real hardware by using past observations, sampled with randomized periods, for both proprioception and vision. We train the RL policy for end-to-end control in a physical simulator without any predefined controller or reference motion, and directly deploy it on the real A1 quadruped robot running in the wild. We evaluate our method in different outdoor environments with complex terrains and obstacles. We demonstrate the robot can smoothly maneuver at a high speed, avoid the obstacles, and show significant improvement over the baselines. Our project page with videos is at https://mehooz.github.io/mmdr-wild/.

updated: Sun Jul 24 2022 01:55:19 GMT+0000 (UTC)

published: Wed Sep 29 2021 16:48:05 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト