Center Loss Regularization for Continual Learning

Kaustubh Olpadkar; Ekta Gavas

継続学習のためのセンターロス正則化

人工知能の開発には、さまざまなタスクを順番に学習する能力が不可欠です。一般に、ニューラルネットワークにはこの機能がなく、大きな障害は壊滅的な忘却です。これは、非定常データ分布から段階的に利用可能な情報が継続的に取得され、モデルがすでに学習したことを混乱させるときに発生します。私たちのアプローチは、決定境界を変更せずに、古いタスクの表現に近い新しいタスクの表現を投影することによって、古いタスクを記憶します。センターロスを正則化ペナルティとして採用し、新しいタスクの機能を古いタスクと同じクラスセンターにするように強制し、機能を非常に識別しやすくします。これにより、すでに学習した情報を忘れることが少なくなります。この方法は実装が簡単で、計算とメモリのオーバーヘッドを最小限に抑え、ニューラルネットワークが連続して発生する多くのタスクにわたって高いパフォーマンスを維持できるようにします。また、メモリリプレイと組み合わせてセンターロスを使用すると、他のリプレイベースの戦略よりも優れていることを示します。継続的な学習のための標準的なMNISTバリアントに加えて、DigitsおよびPACSデータセットを使用した継続的なドメイン適応シナリオにこの方法を適用します。私たちのアプローチはスケーラブルで効果的であり、最先端の継続的な学習方法と比較して競争力のあるパフォーマンスを提供することを示しています。

The ability to learn different tasks sequentially is essential to the development of artificial intelligence. In general, neural networks lack this capability, the major obstacle being catastrophic forgetting. It occurs when the incrementally available information from non-stationary data distributions is continually acquired, disrupting what the model has already learned. Our approach remembers old tasks by projecting the representations of new tasks close to that of old tasks while keeping the decision boundaries unchanged. We employ the center loss as a regularization penalty that enforces new tasks' features to have the same class centers as old tasks and makes the features highly discriminative. This, in turn, leads to the least forgetting of already learned information. This method is easy to implement, requires minimal computational and memory overhead, and allows the neural network to maintain high performance across many sequentially encountered tasks. We also demonstrate that using the center loss in conjunction with the memory replay outperforms other replay-based strategies. Along with standard MNIST variants for continual learning, we apply our method to continual domain adaptation scenarios with the Digits and PACS datasets. We demonstrate that our approach is scalable, effective, and gives competitive performance compared to state-of-the-art continual learning methods.

updated: Thu Oct 21 2021 17:46:44 GMT+0000 (UTC)

published: Thu Oct 21 2021 17:46:44 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト