IB-DRR: Incremental Learning with Information-Back Discrete Representation Replay

Jian Jiang; Edoardo Cetin; Oya Celiktutan

IB-DRR：情報を使用したインクリメンタル学習-バックディスクリート表現のリプレイ

インクリメンタル学習は、機械学習モデルが、古いクラスですでに学習した知識を維持しながら、新しいクラスで新しい知識を継続的に取得できるようにすることを目的としています。以前に見たクラスのトレーニングサンプルのサブセットをメモリに保存し、新しいトレーニングフェーズ中にそれらを再生することは、この目的を達成するための効率的かつ効果的な方法であることが証明されています。モデルが継承するエグザンプラの数が多いほど、達成できるパフォーマンスが向上することは明らかです。ただし、モデルのパフォーマンスとクラスごとに保存するサンプル数との間のトレードオフを見つけることは、リプレイベースの増分学習にとって未解決の問題であり、実際のアプリケーションにとってますます望まれています。このホワイトペーパーでは、2段階の圧縮アプローチを利用して、この未解決の問題にアプローチします。最初のステップは不可逆圧縮です。入力画像をエンコードし、階層的なベクトル量子化変分オートエンコーダー（VQ-VAE）を使用して学習したコードの形式でそれらの離散潜在表現を保存することを提案します。 2番目のステップでは、ビットバック非対称記数法（BB-ANS）を使用して階層潜在変数モデルを学習することにより、コードをロスレスでさらに圧縮します。最初のステップの圧縮で失われた情報を補うために、対照的な学習損失の実際のエグザンプラを利用して分類器のトレーニングを正規化するInformation Back（IB）メカニズムを導入します。見られるすべてのエグザンプラの表現を「コード」の形式で維持することにより、Discrete Representation Replay（DRR）は、サンプルの保存に必要なメモリコストを大幅に削減しながら、CIFAR-100の最先端の方法よりも4％の精度で優れています。。 IBを組み込んで、古い生のエグザンプラの小さなセットも保存することで、DRRの精度を2％の精度でさらに向上させることができます。

Incremental learning aims to enable machine learning models to continuously acquire new knowledge given new classes, while maintaining the knowledge already learned for old classes. Saving a subset of training samples of previously seen classes in the memory and replaying them during new training phases is proven to be an efficient and effective way to fulfil this aim. It is evident that the larger number of exemplars the model inherits the better performance it can achieve. However, finding a trade-off between the model performance and the number of samples to save for each class is still an open problem for replay-based incremental learning and is increasingly desirable for real-life applications. In this paper, we approach this open problem by tapping into a two-step compression approach. The first step is a lossy compression, we propose to encode input images and save their discrete latent representations in the form of codes that are learned using a hierarchical Vector Quantised Variational Autoencoder (VQ-VAE). In the second step, we further compress codes losslessly by learning a hierarchical latent variable model with bits-back asymmetric numeral systems (BB-ANS). To compensate for the information lost in the first step compression, we introduce an Information Back (IB) mechanism that utilizes real exemplars for a contrastive learning loss to regularize the training of a classifier. By maintaining all seen exemplars' representations in the format of `codes', Discrete Representation Replay (DRR) outperforms the state-of-art method on CIFAR-100 by a margin of 4% accuracy with a much less memory cost required for saving samples. Incorporated with IB and saving a small set of old raw exemplars as well, the accuracy of DRR can be further improved by 2% accuracy.

updated: Wed Apr 21 2021 15:32:11 GMT+0000 (UTC)

published: Wed Apr 21 2021 15:32:11 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト