Continual Learning in Neural Networks

Rahaf Aljundi

ニューラルネットワークでの継続的な学習

人工ニューラルネットワークは、いくつかの個別のタスク（音声認識、オブジェクト認識、ビデオゲームなど）を達成する上で、人間レベルのパフォーマンスを超えています。ただし、そのような成功は、無制限の数のタスクを学習して実行できる人間の知能と比較すると、控えめなままです。生涯にわたって知識を学習し蓄積する人間の能力は、知性の重要な側面です。継続的な機械学習は、人工エージェントに非定常で終わりのないデータストリームからオンラインで学習する機能を提供することにより、より高いレベルの機械知能を目指しています。このような終わりのない学習プロセスの重要な要素は、以前に見たデータの壊滅的な忘却を克服することです。これは、ニューラルネットワークが被る問題としてよく知られています。この論文で説明されている作業は、ニューラルネットワークの忘却現象を緩和するための継続的な学習と解決策の調査に専念しています。継続的な学習の問題に取り組むために、まずタスクを1つずつ受信し、以前のタスクのデータは保存されないタスク増分設定を想定しています。タスクの増分設定は、すべての継続的な学習シナリオで想定できるものではないため、より一般的なオンラインの継続的な設定も学習します。監視または自己監視トレーニング信号を使用して、非定常分布から引き出されたデータの無限ストリームを検討します。この論文で提案されている方法は、継続的な学習の重要な側面に取り組んできました。それらは、異なるベンチマークおよびさまざまな学習シーケンスで評価されました。継続的学習の最先端技術の進歩が示されており、継続的学習をアプリケーションに取り入れるための課題が批判的に特定されました。

Artificial neural networks have exceeded human-level performance in accomplishing several individual tasks (e.g. voice recognition, object recognition, and video games). However, such success remains modest compared to human intelligence that can learn and perform an unlimited number of tasks. Humans' ability of learning and accumulating knowledge over their lifetime is an essential aspect of their intelligence. Continual machine learning aims at a higher level of machine intelligence through providing the artificial agents with the ability to learn online from a non-stationary and never-ending stream of data. A key component of such a never-ending learning process is to overcome the catastrophic forgetting of previously seen data, a problem that neural networks are well known to suffer from. The work described in this thesis has been dedicated to the investigation of continual learning and solutions to mitigate the forgetting phenomena in neural networks. To approach the continual learning problem, we first assume a task incremental setting where tasks are received one at a time and data from previous tasks are not stored. Since the task incremental setting can't be assumed in all continual learning scenarios, we also study the more general online continual setting. We consider an infinite stream of data drawn from a non-stationary distribution with a supervisory or self-supervisory training signal. The proposed methods in this thesis have tackled important aspects of continual learning. They were evaluated on different benchmarks and over various learning sequences. Advances in the state of the art of continual learning have been shown and challenges for bringing continual learning into application were critically identified.

updated: Fri Oct 18 2019 09:48:14 GMT+0000 (UTC)

published: Mon Oct 07 2019 10:52:14 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト