Double-Dot Network for Antipodal Grasp Detection

Yao Wang; Yangtao Zheng; Boyang Gao; Di Huang

対蹠把持検出用のダブルドットネットワーク

この論文では、Double-Dot Network（DD-Net）と呼ばれる、対蹠地把持検出への新しい深層学習アプローチを提案します。これは、経験的に事前設定されたアンカーに依存しない最近のアンカーフリーオブジェクト検出フレームワークに従い、したがって、見えないオブジェクトのより一般化された柔軟な予測を可能にします。具体的には、広く使用されている5次元の長方形とは異なり、グリッパー構成は1対の指先として定義されます。このような指先をローカライズするために効果的なCNNアーキテクチャが導入され、改良のための補助センターの助けを借りて、把握候補者を正確かつ確実に推測します。さらに、把持の品質を測定するための特殊な損失関数を設計し、オブジェクト検出で採用されたバウンディングボックスのIoUスコアとは対照的に、把持検出タスクとの整合性が高くなっています。シミュレーションとロボット実験の両方が実行され、最先端の精度が達成され、DD-Netが目に見えないオブジェクトの処理において対応するものより優れていることを示しています。

This paper proposes a new deep learning approach to antipodal grasp detection, named Double-Dot Network (DD-Net). It follows the recent anchor-free object detection framework, which does not depend on empirically pre-set anchors and thus allows more generalized and flexible prediction on unseen objects. Specifically, unlike the widely used 5-dimensional rectangle, the gripper configuration is defined as a pair of fingertips. An effective CNN architecture is introduced to localize such fingertips, and with the help of auxiliary centers for refinement, it accurately and robustly infers grasp candidates. Additionally, we design a specialized loss function to measure the quality of grasps, and in contrast to the IoU scores of bounding boxes adopted in object detection, it is more consistent to the grasp detection task. Both the simulation and robotic experiments are executed and state of the art accuracies are achieved, showing that DD-Net is superior to the counterparts in handling unseen objects.

updated: Tue Aug 03 2021 14:21:17 GMT+0000 (UTC)

published: Tue Aug 03 2021 14:21:17 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト