Efficient Model Adaptation for Continual Learning at the Edge

Zachary A. Daniels; Jun Hu; Michael Lomnitz; Phil Miller; Aswin Raghavan; Joe Zhang; Michael Piacentino; David Zhang

エッジでの継続的な学習のための効率的なモデル適応

ほとんどの機械学習 (ML) システムは、トレーニングおよび展開中に定常で一致するデータ分布を前提としています。これは多くの場合、誤った仮定です。 ML モデルが実際のデバイスに展開されると、環境要因、センサーの特性、関心のあるタスクの変化により、データの分布が時間の経過とともに変化することがよくあります。人間が参加して配布の変化を監視し、その変化に応じて新しいアーキテクチャを設計することは可能ですが、そのようなセットアップは費用対効果が高くありません。代わりに、非定常の自動 ML (AutoML) モデルが必要です。このペーパーでは、ドメインシフト下での効率的な継続学習のための Encoder-Adaptor-Reconfigurator (EAR) フレームワークを紹介します。 EAR フレームワークは、固定ディープニューラルネットワーク (DNN) 機能エンコーダーを使用し、エンコーダー上で浅いネットワークをトレーニングして新しいデータを処理します。 EAR フレームワークは、1) DNN と超次元コンピューティング (HDC) を組み合わせることにより、新しいデータが配布外 (OOD) であることを検出する、2) ゼロパラメータを使用してモデルを OOD データに適応させるための低パラメーターのニューラルアダプターを識別することができます。ショットニューラルアーキテクチャ検索 (ZS-NAS)、3) 必要に応じてニューラルアーキテクチャを段階的に拡張し、ドメイン増分およびクラス増分の継続学習を処理するために適切なアダプターとリコンフィギュレーターを介してデータを動的にルーティングすることで、以前のタスクの壊滅的な忘れを最小限に抑えます。私たちは、ドメイン適応のためのいくつかのベンチマークデータセットに対するアプローチを系統的に評価し、OOD 検出や少数/ゼロショット NAS の最先端のアルゴリズムと比較して強力なパフォーマンスを実証しています。

Most machine learning (ML) systems assume stationary and matching data distributions during training and deployment. This is often a false assumption. When ML models are deployed on real devices, data distributions often shift over time due to changes in environmental factors, sensor characteristics, and task-of-interest. While it is possible to have a human-in-the-loop to monitor for distribution shifts and engineer new architectures in response to these shifts, such a setup is not cost-effective. Instead, non-stationary automated ML (AutoML) models are needed. This paper presents the Encoder-Adaptor-Reconfigurator (EAR) framework for efficient continual learning under domain shifts. The EAR framework uses a fixed deep neural network (DNN) feature encoder and trains shallow networks on top of the encoder to handle novel data. The EAR framework is capable of 1) detecting when new data is out-of-distribution (OOD) by combining DNNs with hyperdimensional computing (HDC), 2) identifying low-parameter neural adaptors to adapt the model to the OOD data using zero-shot neural architecture search (ZS-NAS), and 3) minimizing catastrophic forgetting on previous tasks by progressively growing the neural architecture as needed and dynamically routing data through the appropriate adaptors and reconfigurators for handling domain-incremental and class-incremental continual learning. We systematically evaluate our approach on several benchmark datasets for domain adaptation and demonstrate strong performance compared to state-of-the-art algorithms for OOD detection and few-/zero-shot NAS.

updated: Fri Oct 13 2023 19:02:33 GMT+0000 (UTC)

published: Thu Aug 03 2023 23:55:17 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト