Fully Online Meta-Learning Without Task Boundaries

Jathushan Rajasegaran; Chelsea Finn; Sergey Levine

タスク境界のない完全なオンラインメタ学習

ディープネットワークは分類器、検出器、トラッカーなどの複雑な機能を学習できますが、多くのアプリケーションでは、入力分布の変化、タスクの変化、環境条件の変化に継続的に適応するモデルが必要です。実際、知識を継続的に蓄積し、過去の経験を使用して継続的な設定で新しいタスクをすばやく学習するこの機能は、インテリジェントシステムの重要な特性の1つです。複雑で高次元の問題の場合、最急降下法などの標準的な学習アルゴリズムを使用してモデルを継続的に更新するだけでは、適応が遅くなる可能性があります。メタ学習は、適応を加速するための強力なツールを提供できますが、従来はバッチ設定で研究されていました。このホワイトペーパーでは、メタ学習を適用してこの種のオンライン問題に取り組む方法を研究し、同時に変化するタスクと入力分布に適応し、将来より迅速に適応するためにモデルをメタトレーニングします。メタ学習をオンライン設定に拡張することには独自の課題があり、いくつかの以前の方法で関連する問題を研究しましたが、一般に、既知のグラウンドトゥルースタスク境界を持つタスクの個別の概念が必要です。このような方法は通常、タスク間で継続的に適応するのではなく、タスク間でモデルをリセットして、各タスクに順番に適応します。多くの実際の設定では、このような個別の境界は使用できず、存在しない場合もあります。これらの設定に対処するために、完全オンラインメタ学習（FOML）アルゴリズムを提案します。これは、タスクの境界に関するグラウンドトゥルースの知識を必要とせず、事前にトレーニングされた重みにリセットせずに完全にオンラインのままです。私たちの実験は、FOMLがRainbow-MNIST、CIFAR100、およびCELEBAデータセットの最先端のオンライン学習方法よりも速く新しいタスクを学習できたことを示しています。

While deep networks can learn complex functions such as classifiers, detectors, and trackers, many applications require models that continually adapt to changing input distributions, changing tasks, and changing environmental conditions. Indeed, this ability to continuously accrue knowledge and use past experience to learn new tasks quickly in continual settings is one of the key properties of an intelligent system. For complex and high-dimensional problems, simply updating the model continually with standard learning algorithms such as gradient descent may result in slow adaptation. Meta-learning can provide a powerful tool to accelerate adaptation yet is conventionally studied in batch settings. In this paper, we study how meta-learning can be applied to tackle online problems of this nature, simultaneously adapting to changing tasks and input distributions and meta-training the model in order to adapt more quickly in the future. Extending meta-learning into the online setting presents its own challenges, and although several prior methods have studied related problems, they generally require a discrete notion of tasks, with known ground-truth task boundaries. Such methods typically adapt to each task in sequence, resetting the model between tasks, rather than adapting continuously across tasks. In many real-world settings, such discrete boundaries are unavailable, and may not even exist. To address these settings, we propose a Fully Online Meta-Learning (FOML) algorithm, which does not require any ground truth knowledge about the task boundaries and stays fully online without resetting back to pre-trained weights. Our experiments show that FOML was able to learn new tasks faster than the state-of-the-art online learning methods on Rainbow-MNIST, CIFAR100 and CELEBA datasets.

updated: Mon Feb 14 2022 06:53:32 GMT+0000 (UTC)

published: Tue Feb 01 2022 07:51:24 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト