Augmenting Imitation Experience via Equivariant Representations

Dhruv Sharma; Alihusein Kuwajerwala; Florian Shkurti

同変表現による模倣体験の強化

模倣によってトレーニングされたビジュアルナビゲーションポリシーの堅牢性は、多くの場合、トレーニングされた画像とアクションのペアの増強に依存します。従来、これは、複数のカメラからデータを収集するか、各画像にランダムノイズを追加するなど、コンピュータービジョンからの標準的なデータ拡張を使用するか、トレーニング画像を合成することによって行われました。この論文では、トレーニングデータで観察されたものの近くの視点の埋め込みとアクションを外挿することに基づいて、ビジュアルナビゲーションのデータ拡張の別の実用的な代替手段があることを示します。私たちの方法は、2Dおよび3Dの視覚的ナビゲーション問題のジオメトリを利用し、画像ではなく、同変埋め込みの関数であるポリシーに依存しています。トレーニングナビゲーションデータセットからの画像とアクションのペアが与えられると、ニューラルネットワークモデルは、同変プロパティを使用して、近くの視点での画像の潜在表現を予測し、データセットを拡張します。次に、拡張されたデータセットに関するポリシーをトレーニングします。私たちのシミュレーション結果は、この方法でトレーニングされたポリシーは、クロストラックエラーが減少し、標準の拡張方法を使用してトレーニングされたポリシーと比較して、必要な介入が少ないことを示しています。また、500m以上の経路に沿った実際の地上ロボットによる自律視覚ナビゲーションでも同様の結果を示しています。

The robustness of visual navigation policies trained through imitation often hinges on the augmentation of the training image-action pairs. Traditionally, this has been done by collecting data from multiple cameras, by using standard data augmentations from computer vision, such as adding random noise to each image, or by synthesizing training images. In this paper we show that there is another practical alternative for data augmentation for visual navigation based on extrapolating viewpoint embeddings and actions nearby the ones observed in the training data. Our method makes use of the geometry of the visual navigation problem in 2D and 3D and relies on policies that are functions of equivariant embeddings, as opposed to images. Given an image-action pair from a training navigation dataset, our neural network model predicts the latent representations of images at nearby viewpoints, using the equivariance property, and augments the dataset. We then train a policy on the augmented dataset. Our simulation results indicate that policies trained in this way exhibit reduced cross-track error, and require fewer interventions compared to policies trained using standard augmentation methods. We also show similar results in autonomous visual navigation by a real ground robot along a path of over 500m.

updated: Thu Oct 14 2021 18:56:08 GMT+0000 (UTC)

published: Thu Oct 14 2021 18:56:08 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト