Knowledge Capture and Replay for Continual Learning

Saisubramaniam Gopalakrishnan; Pranshu Ranjan Singh; Haytham Fayek; Savitha Ramasamy; Arulmurugan Ambikapathi

継続的な学習のための知識の獲得と再生

ディープニューラルネットワークはいくつかのドメインで有望であり、学習されたタスク固有の情報は暗黙的にネットワークパラメータに保存されます。継続的な学習などのダウンストリームタスクには、これらのネットワークからの表現を利用することが不可欠です。この論文では、ランダムな画像パターンの関数として、ネットワークのエンコードされた知識をキャプチャするための視覚的表現であるフラッシュカードの概念を紹介します。表現をキャプチャする際のフラッシュカードの有効性を示し、一般的およびタスクにとらわれない継続的な学習設定のための効率的な再生方法であることを示します。したがって、新しいタスクに適応する一方で、構築されたフラッシュカードの数が限られているため、以前に学習したタスクの壊滅的な忘却を防ぐのに役立ちます。最も興味深いことに、そのようなフラッシュカードは、外部メモリストレージを必要とせず、複数のタスクにわたって蓄積する必要もなく、以前にトレーニングされたタスクの数に関係なく、後続の新しいタスクを学習する直前に構築する必要があるため、タスクに依存しません。最初に、トレーニングされたネットワークから知識表現をキャプチャする際のフラッシュカードの有効性を示し、さまざまな継続的な学習タスク（継続的な教師なし再構築、継続的なノイズ除去、新しいインスタンスの学習分類）でのフラッシュカードの有効性を、多数の異種を使用して経験的に検証します。ベンチマークデータセット。これらの研究はまた、再生戦略としてフラッシュカードを使用した継続的な学習アルゴリズムが他の最先端の再生方法よりも優れており、コアセットサンプリングを使用して可能な限り最高のベースラインと同等のパフォーマンスを示し、追加の計算の複雑さとストレージが最小であることを示しています。

Deep neural networks have shown promise in several domains, and the learned task-specific information is implicitly stored in the network parameters. It will be vital to utilize representations from these networks for downstream tasks such as continual learning. In this paper, we introduce the notion of flashcards that are visual representations to capture the encoded knowledge of a network, as a function of random image patterns. We demonstrate the effectiveness of flashcards in capturing representations and show that they are efficient replay methods for general and task agnostic continual learning setting. Thus, while adapting to a new task, a limited number of constructed flashcards, help to prevent catastrophic forgetting of the previously learned tasks. Most interestingly, such flashcards neither require external memory storage nor need to be accumulated over multiple tasks and only need to be constructed just before learning the subsequent new task, irrespective of the number of tasks trained before and are hence task agnostic. We first demonstrate the efficacy of flashcards in capturing knowledge representation from a trained network, and empirically validate the efficacy of flashcards on a variety of continual learning tasks: continual unsupervised reconstruction, continual denoising, and new-instance learning classification, using a number of heterogeneous benchmark datasets. These studies also indicate that continual learning algorithms with flashcards as the replay strategy perform better than other state-of-the-art replay methods, and exhibits on par performance with the best possible baseline using coreset sampling, with the least additional computational complexity and storage.

updated: Sat Dec 12 2020 11:24:45 GMT+0000 (UTC)

published: Sat Dec 12 2020 11:24:45 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト