Continual learning under domain transfer with sparse synaptic bursting

Shawn L. Beaulieu; Jeff Clune; Nick Cheney

スパースシナプスバーストを伴うドメイン転送の下での継続的な学習

既存のマシンは、予測と制御を容易にするために作成された機能固有のツールです。明日のマシンは、その可変性、回復力、および自律性において、生物学的システムに近い可能性があります。しかし、最初に、彼らは、恣意的に頻繁に公開されることなく、新しい情報を順番に学習し、保持することができなければなりません。このようなシステムを設計するための過去の取り組みは、特定のタスクまたは入力に一意に敏感な互いに素な重みのセットを使用して、人工ニューラルネットワークを構築または調整しようと努めてきました。これは、既存の知識を損なうことなく、以前は見られなかったデータの長いシーケンスを継続的に学習することをまだ可能にしていません。これは、壊滅的な忘却として知られる問題です。この論文では、これまでに見られなかったデータセット（ImageNet、CIFAR-100）を、時間の経過とともにほとんど忘れることなく順次学習できるシステムを紹介します。これは、2番目のフィードフォワードニューラルネットワークによって生成されたトップダウンレギュレーションを使用して、入力に基づいて畳み込みニューラルネットワークの重みのアクティビティを制御することによって行われます。私たちの方法は、タスク固有のモジュールを維持するのではなく、タスク間でリサイクルされる重みのアクティビティのまばらなバーストを使用して、ドメイン転送の下で継続的に学習することがわかります。スパースシナプスバーストは、現存する知識を損なうことなく新しい機能を学習できるように、活動と抑制のバランスを取り、カオスの端にあるシステムの秩序と無秩序のバランスを反映していることがわかります。この動作は、事前トレーニング（または「メタ学習」）フェーズで発生します。このフェーズでは、調整されたシナプスが、予測エラーの最小化による均一な抑制の初期状態から選択的に抑制解除または成長します。

Existing machines are functionally specific tools that were made for easy prediction and control. Tomorrow's machines may be closer to biological systems in their mutability, resilience, and autonomy. But first they must be capable of sequentially learning, and retaining, new information without being exposed to it arbitrarily often. Past efforts to engineer such systems have sought to build or regulate artificial neural networks using disjoint sets of weights that are uniquely sensitive to specific tasks or inputs. This has not yet enabled continual learning over long sequences of previously unseen data without corrupting existing knowledge: a problem known as catastrophic forgetting. In this paper, we introduce a system that can learn sequentially over previously unseen datasets (ImageNet, CIFAR-100) with little forgetting over time. This is done by controlling the activity of weights in a convolutional neural network on the basis of inputs using top-down regulation generated by a second feed-forward neural network. We find that our method learns continually under domain transfer with sparse bursts of activity in weights that are recycled across tasks, rather than by maintaining task-specific modules. Sparse synaptic bursting is found to balance activity and suppression such that new functions can be learned without corrupting extant knowledge, thus mirroring the balance of order and disorder in systems at the edge of chaos. This behavior emerges during a prior pre-training (or 'meta-learning') phase in which regulated synapses are selectively disinhibited, or grown, from an initial state of uniform suppression through prediction error minimization.

updated: Sun May 08 2022 16:48:56 GMT+0000 (UTC)

published: Thu Aug 26 2021 22:53:27 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト