Reinforcement Learning for Picking Cluttered General Objects with Dense Object Descriptors

Hoang-Giang Cao; Weihao Zeng; I-Chen Wu

密なオブジェクト記述子を使用して雑然とした一般オブジェクトを選択するための強化学習

雑然とした一般的なオブジェクトを選択することは、複雑な形状とさまざまなスタック構成のため、困難な作業です。多くの先行研究は、ピッキングに姿勢推定を利用していますが、散らかったオブジェクトでは姿勢推定が困難です。このホワイトペーパーでは、豊富なオブジェクト構造を表すことができる密集したオブジェクト記述子である Cluttered Objects Descriptor (COD) を提案し、事前トレーニング済みの COD ネットワークとその中間出力を使用して、ピッキングポリシーをトレーニングします。さらに、強化学習を使用してポリシーをトレーニングします。これにより、ポリシーは監督なしでピッキングを学習できます。私たちは実験を行って、私たちの COD が目に見えるオブジェクトと見えないオブジェクトを一貫して表すことができることを実証しました。結果として得られるポリシーは、トレーニングシナリオの 2 倍の乱雑な実験環境で、目に見えないオブジェクトの 96.69% を選択できます。

Picking cluttered general objects is a challenging task due to the complex geometries and various stacking configurations. Many prior works utilize pose estimation for picking, but pose estimation is difficult on cluttered objects. In this paper, we propose Cluttered Objects Descriptors (CODs), a dense cluttered objects descriptor that can represent rich object structures, and use the pre-trained CODs network along with its intermediate outputs to train a picking policy. Additionally, we train the policy with reinforcement learning, which enable the policy to learn picking without supervision. We conduct experiments to demonstrate that our CODs is able to consistently represent seen and unseen cluttered objects, which allowed for the picking policy to robustly pick cluttered general objects. The resulting policy can pick 96.69% of unseen objects in our experimental environment which is twice as cluttered as the training scenarios.

updated: Thu Apr 20 2023 06:24:33 GMT+0000 (UTC)

published: Thu Apr 20 2023 06:24:33 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト