EXPANSE: A Deep Continual / Progressive Learning System for Deep Transfer Learning

Mohammadreza Iman; John A. Miller; Khaled Rasheed; Robert M. Branch; Hamid R. Arabnia

EXPANSE：ディープトランスファーラーニングのためのディープコンティニュアス/プログレッシブラーニングシステム

ディープラーニング学習手法は、取得した知識を再利用することにより、ディープラーニングの制限、広範なトレーニングデータへの依存、およびトレーニングコストに対処しようとします。ただし、現在のDTL手法は、事前にトレーニングされたモデルを微調整したり、事前にトレーニングされたモデルの一部を凍結したりする際に、壊滅的な忘却のジレンマ（以前に取得した知識を失う）または過度にバイアスされた事前トレーニング済みモデル（ターゲットデータに適応するのが難しい）のいずれかに悩まされます。、それぞれ。 DTLのサブカテゴリであるプログレッシブ学習は、凍結された事前トレーニング済みモデルの最後に新しい層を追加することにより、以前の層を凍結した場合の過度に偏ったモデルの影響を減らします。多くの場合成功していますが、遠方のソースデータとターゲットデータを処理することはできません。これらの制限に取り組むために、ディープトランスファー学習のための新しい継続的/進歩的な学習アプローチを提案します。壊滅的な忘却と過度に偏ったモデルの問題の両方を回避するために、新しいレイヤーを追加するだけでなく、モデル内の事前トレーニング済みレイヤーを拡張する（各レイヤーに新しいノードを追加する）ことによって、事前トレーニング済みモデルを拡張します。したがって、このメソッドの名前はEXPANSEです。私たちの実験結果は、この手法を使用して、離れたソースデータとターゲットデータに取り組むことができることを確認しています。同時に、最終モデルはソースデータで引き続き有効であり、有望な深い継続的な学習アプローチを実現します。さらに、人間の教育システムに触発された深層学習モデルをトレーニングする新しい方法を提供します。この2段階のトレーニングを、最初に基本を学び、次に複雑さと不確実性を追加することと呼びました。この評価は、2段階のトレーニングでは、通常のトレーニングと比較して精度が向上するため、エラーサーフェス上でより意味のある特徴とより細かい盆地が抽出されることを意味します。 EXPANSE（モデルの拡張と2段階のトレーニング）は、さまざまな問題やDLモデルに適用できる体系的な継続的な学習アプローチです。

Deep transfer learning techniques try to tackle the limitations of deep learning, the dependency on extensive training data and the training costs, by reusing obtained knowledge. However, the current DTL techniques suffer from either catastrophic forgetting dilemma (losing the previously obtained knowledge) or overly biased pre-trained models (harder to adapt to target data) in finetuning pre-trained models or freezing a part of the pre-trained model, respectively. Progressive learning, a sub-category of DTL, reduces the effect of the overly biased model in the case of freezing earlier layers by adding a new layer to the end of a frozen pre-trained model. Even though it has been successful in many cases, it cannot yet handle distant source and target data. We propose a new continual/progressive learning approach for deep transfer learning to tackle these limitations. To avoid both catastrophic forgetting and overly biased-model problems, we expand the pre-trained model by expanding pre-trained layers (adding new nodes to each layer) in the model instead of only adding new layers. Hence the method is named EXPANSE. Our experimental results confirm that we can tackle distant source and target data using this technique. At the same time, the final model is still valid on the source data, achieving a promising deep continual learning approach. Moreover, we offer a new way of training deep learning models inspired by the human education system. We termed this two-step training: learning basics first, then adding complexities and uncertainties. The evaluation implies that the two-step training extracts more meaningful features and a finer basin on the error surface since it can achieve better accuracy in comparison to regular training. EXPANSE (model expansion and two-step training) is a systematic continual learning approach applicable to different problems and DL models.

updated: Tue May 24 2022 01:13:36 GMT+0000 (UTC)

published: Thu May 19 2022 03:54:58 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト