OpenLORIS-Object: A Robotic Vision Dataset and Benchmark for Lifelong Deep Learning

Qi She; Fan Feng; Xinyue Hao; Qihan Yang; Chuanlin Lan; Vincenzo Lomonaco; Xuesong Shi; Zhengwei Wang; Yao Guo; Yimin Zhang; Fei Qiao; Rosa H. M. Chan

OpenLORIS-Object：生涯にわたる深層学習のためのロボットビジョンデータセットとベンチマーク

コンピュータービジョンの最近のブレークスルーは、トレーニング用の大規模な代表的なデータセット（ImageNetやCOCOなど）の可用性の恩恵を受けています。しかし、ロボットビジョンは、タスクの固定セットに対する非変動分布に対する暗黙の仮定のため、これらの標準的なコンピュータビジョンデータセットから開発された視覚アルゴリズムを適用するためのユニークな課題をもたらします。新しいタスクが利用可能になるたびにモデルを完全に再トレーニングすることは、計算、ストレージ、そして時にはプライバシーの問題のために実行不可能です。一方、ナイーブな増分戦略は壊滅的な忘却に苦しむことが示されています。生涯学習が基本的な能力である適応型視覚知覚システムを使用して、オープンセットで有害な条件下でロボットを継続的に動作させることが重要です。ただし、新しい技術を評価および比較するために利用できるデータセットとベンチマークはほとんどありません。このギャップを埋めるために、RGB-Dカメラを介して収集された新しい生涯のロボットビジョンデータセット（「OpenLORIS-Object」）を提供します。このデータセットには、実際のアプリケーションでロボットが直面する課題が組み込まれており、生涯にわたるオブジェクト認識アルゴリズムを検証するための新しいベンチマークが提供されます。さらに、9つの最新の生涯学習アルゴリズムのテストベッドを提供しています。それぞれには、OpenLORIS-Objectデータセットに対する4つの評価指標を持つ48のタスクが含まれます。結果は、絶えず変化する困難な環境での物体認識タスクは解決にはほど遠いことであり、ボトルネックは前方/後方転送設計にあることを示しています。データセットとベンチマークは、https：//lifelong-robotic-vision.github.io/dataset/objecthttps：//lifelong-robotic-vision.github.io/dataset/objectで公開されています。

The recent breakthroughs in computer vision have benefited from the availability of large representative datasets (e.g. ImageNet and COCO) for training. Yet, robotic vision poses unique challenges for applying visual algorithms developed from these standard computer vision datasets due to their implicit assumption over non-varying distributions for a fixed set of tasks. Fully retraining models each time a new task becomes available is infeasible due to computational, storage and sometimes privacy issues, while naïve incremental strategies have been shown to suffer from catastrophic forgetting. It is crucial for the robots to operate continuously under open-set and detrimental conditions with adaptive visual perceptual systems, where lifelong learning is a fundamental capability. However, very few datasets and benchmarks are available to evaluate and compare emerging techniques. To fill this gap, we provide a new lifelong robotic vision dataset ("OpenLORIS-Object") collected via RGB-D cameras. The dataset embeds the challenges faced by a robot in the real-life application and provides new benchmarks for validating lifelong object recognition algorithms. Moreover, we have provided a testbed of 9 state-of-the-art lifelong learning algorithms. Each of them involves 48 tasks with 4 evaluation metrics over the OpenLORIS-Object dataset. The results demonstrate that the object recognition task in the ever-changing difficulty environments is far from being solved and the bottlenecks are at the forward/backward transfer designs. Our dataset and benchmark are publicly available at at https://lifelong-robotic-vision.github.io/dataset/objecthttps://lifelong-robotic-vision.github.io/dataset/object.

updated: Fri Mar 06 2020 05:31:06 GMT+0000 (UTC)

published: Fri Nov 15 2019 06:27:27 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト