Continual Learning with Pretrained Backbones by Tuning in the Input Space

Simone Marullo; Matteo Tiezzi; Marco Gori; Stefano Melacci; Tinne Tuytelaars

入力空間での調整による事前トレーニング済みのバックボーンによる継続的な学習

深層学習モデルを非定常環境に適応させることは本質的に難しいため、現実世界のタスクへのニューラルネットワークの適用可能性は制限されます。この問題は、さまざまなタスク予測子が時間の経過とともに順次学習される潜在空間への投影を事前トレーニングされたモデルが計算する設定など、実際の教師あり学習設定では重要です。実際のところ、新しいタスクにうまく適応するためにモデル全体を段階的に微調整すると、通常、過去の経験に比べてパフォーマンスが低下し、トレーニング前の段階での貴重な知識が失われるという壊滅的な忘却が生じます。この論文では、ネットワークの事前トレーニングされた部分の更新を回避し、通常の分類ヘッドだけでなく、新しく導入された学習可能なパラメータのセットも学習することにより、微調整手順をより効果的にするための新しい戦略を提案します。入力データの変換を担当します。このプロセスにより、ネットワークはトレーニング前の知識を効果的に活用し、適度な計算量で可塑性と安定性の間の適切なトレードオフを見つけることができるため、特にオンザエッジ設定に適しています。継続学習設定における 4 つの画像分類問題に関する実験では、いくつかの微調整手順や一般的な継続学習方法と比較したときに、提案されたアプローチの品質が確認されました。

The intrinsic difficulty in adapting deep learning models to non-stationary environments limits the applicability of neural networks to real-world tasks. This issue is critical in practical supervised learning settings, such as the ones in which a pre-trained model computes projections toward a latent space where different task predictors are sequentially learned over time. As a matter of fact, incrementally fine-tuning the whole model to better adapt to new tasks usually results in catastrophic forgetting, with decreasing performance over the past experiences and losing valuable knowledge from the pre-training stage. In this paper, we propose a novel strategy to make the fine-tuning procedure more effective, by avoiding to update the pre-trained part of the network and learning not only the usual classification head, but also a set of newly-introduced learnable parameters that are responsible for transforming the input data. This process allows the network to effectively leverage the pre-training knowledge and find a good trade-off between plasticity and stability with modest computational efforts, thus especially suitable for on-the-edge settings. Our experiments on four image classification problems in a continual learning setting confirm the quality of the proposed approach when compared to several fine-tuning procedures and to popular continual learning methods.

updated: Thu Jun 08 2023 07:43:36 GMT+0000 (UTC)

published: Mon Jun 05 2023 15:11:59 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト