Learning from Demonstration with Weakly Supervised Disentanglement

Yordan Hristov; Subramanian Ramamoorthy

弱く教師あり解きほぐしによるデモンストレーションからの学習

柔らかいスポンジで拭くなどのロボット操作タスクには、複数の豊富な感覚モダリティからの制御が必要です。ロボットを教えることを目的とした人間とロボットの相互作用は、豊富なデータストリームの人間と機械の理解の間に不一致が生じる可能性があるため、この設定では困難です。デモンストレーションからの解釈可能な学習のタスクを、確率的生成モデルに対する最適化問題として扱います。データの高次元性を説明するために、モデルを表すために大容量のニューラルネットワークが選択されます。このモデルの潜在変数は、一連のデモンストレーションで明示されている高レベルの概念および概念と明示的に整合しています。このような配置は、潜在変数よりも優先順位を選択する設計者の従来のアプローチとは対照的に、適切に制限された語彙でエンドユーザーからのラベルを使用することによって最もよく達成されることを示します。私たちのアプローチは、PR2ロボットによって実行される2つの卓上ロボット操作タスクのコンテキストで評価されます-スポンジで液体を軽くたたく（スポンジを強く押して表面に沿って動かす）ことと、異なる容器の間に注ぐことです。ロボットは、視覚情報、腕の関節の位置、腕の関節の動きを提供します。タスクとデータのビデオを利用できるようにしました。https：//sites.google.com/view/weak-label-lfdの補足資料を参照してください。

Robotic manipulation tasks, such as wiping with a soft sponge, require control from multiple rich sensory modalities. Human-robot interaction, aimed at teaching robots, is difficult in this setting as there is potential for mismatch between human and machine comprehension of the rich data streams. We treat the task of interpretable learning from demonstration as an optimisation problem over a probabilistic generative model. To account for the high-dimensionality of the data, a high-capacity neural network is chosen to represent the model. The latent variables in this model are explicitly aligned with high-level notions and concepts that are manifested in a set of demonstrations. We show that such alignment is best achieved through the use of labels from the end user, in an appropriately restricted vocabulary, in contrast to the conventional approach of the designer picking a prior over the latent variables. Our approach is evaluated in the context of two table-top robot manipulation tasks performed by a PR2 robot -- that of dabbing liquids with a sponge (forcefully pressing a sponge and moving it along a surface) and pouring between different containers. The robot provides visual information, arm joint positions and arm joint efforts. We have made videos of the tasks and data available - see supplementary materials at: https://sites.google.com/view/weak-label-lfd.

updated: Fri Mar 26 2021 12:15:52 GMT+0000 (UTC)

published: Tue Jun 16 2020 12:29:51 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト