ZS-IL: Looking Back on Learned ExperiencesFor Zero-Shot Incremental Learning

Mozhgan PourKeshavarz; Mohammad Sabokrou

ZS-IL：ゼロショットインクリメンタル学習のために学習した経験を振り返る

従来のディープニューラルネットワークは、トレーニングデータの新しいストリームから学習する能力に制限があります。新しいタスクや進化するタスクについて順次トレーニングすると、パフォーマンスが急激に低下し、実際のユースケースでは不適切になります。既存の方法は、古いデータサンプルを保存するか、DNNのパラメータセットのみを更新することでこれに取り組みますが、大きなメモリバジェットが必要になるか、増分されたクラス分布を学習するモデルの柔軟性が損なわれます。このホワイトペーパーでは、データストリームに新しいクラスが発生するたびに過去の経験を提供するために、オンコール転送セットに光を当てます。特に、モデルが学習した過去の経験を再現するだけでなく、これをゼロショット方式で実行するために、ゼロショットインクリメンタル学習を提案します。この目的に向けて、新しいタスク（クラス）が出現するたびに過去のエグザンプラを合成するためにネットワークにクエリを実行するメモリ回復パラダイムを導入しました。したがって、私たちの方法は、前のクラスを壊滅的に忘れることを軽減するために、転送セットと呼ばれる過去のエグザンプラを提供するために提案されたメモリ回復パラダイムを呼び出す以外に、固定サイズのメモリを必要としません。さらに、最近提案された方法とは対照的に、提案されたパラダイムは、学習者ネットワークのみに依存しているため、並列アーキテクチャを望んでいません。過去のデータサンプルをバッファリングしない最先端のデータ手法と比較して、ZS-ILは、タスクILとクラスILの両方の設定で、既知のデータセット（CIFAR-10、Tiny-ImageNet）で大幅に優れたパフォーマンスを示します。。

Classical deep neural networks are limited in their ability to learn from emerging streams of training data. When trained sequentially on new or evolving tasks, their performance degrades sharply, making them inappropriate in real-world use cases. Existing methods tackle it by either storing old data samples or only updating a parameter set of DNNs, which, however, demands a large memory budget or spoils the flexibility of models to learn the incremented class distribution. In this paper, we shed light on an on-call transfer set to provide past experiences whenever a new class arises in the data stream. In particular, we propose a Zero-Shot Incremental Learning not only to replay past experiences the model has learned but also to perform this in a zero-shot manner. Towards this end, we introduced a memory recovery paradigm in which we query the network to synthesize past exemplars whenever a new task (class) emerges. Thus, our method needs no fixed-sized memory, besides calls the proposed memory recovery paradigm to provide past exemplars, named a transfer set in order to mitigate catastrophically forgetting the former classes. Moreover, in contrast with recently proposed methods, the suggested paradigm does not desire a parallel architecture since it only relies on the learner network. Compared to the state-of-the-art data techniques without buffering past data samples, ZS-IL demonstrates significantly better performance on the well-known datasets (CIFAR-10, Tiny-ImageNet) in both Task-IL and Class-IL settings.

updated: Mon Mar 22 2021 22:43:20 GMT+0000 (UTC)

published: Mon Mar 22 2021 22:43:20 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト