Visionary: Vision architecture discovery for robot learning

Iretiayo Akinola; Anelia Angelova; Yao Lu; Yevgen Chebotar; Dmitry Kalashnikov; Jacob Varley; Julian Ibarz; Michael S. Ryoo

ビジョナリー：ロボット学習のためのビジョンアーキテクチャの発見

低次元のアクション入力と高次元の視覚入力の間の相互作用を発見する、ロボット操作学習のための視覚ベースのアーキテクチャ検索アルゴリズムを提案します。私たちのアプローチは、タスクのトレーニング中にアーキテクチャを自動的に設計します。つまり、画像の特徴表現をアクションや前のレイヤーの特徴と組み合わせて参加する新しい方法を発見します。得られた新しいアーキテクチャは、最近の高性能ベースラインと比較して、場合によっては大きなマージンで、より良いタスク成功率を示しています。実際のロボット実験でも、把持性能が6％向上することが確認されています。これは、実際のロボットタスクのニューラルアーキテクチャ検索と注意接続検索の成功を実証する最初のアプローチです。

We propose a vision-based architecture search algorithm for robot manipulation learning, which discovers interactions between low dimension action inputs and high dimensional visual inputs. Our approach automatically designs architectures while training on the task - discovering novel ways of combining and attending image feature representations with actions as well as features from previous layers. The obtained new architectures demonstrate better task success rates, in some cases with a large margin, compared to a recent high performing baseline. Our real robot experiments also confirm that it improves grasping performance by 6%. This is the first approach to demonstrate a successful neural architecture search and attention connectivity search for a real-robot task.

updated: Fri Mar 26 2021 17:51:43 GMT+0000 (UTC)

published: Fri Mar 26 2021 17:51:43 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト