Hierarchical Training of Deep Neural Networks Using Early Exiting

Yamin Sepehri; Pedram Pad; Ahmet Caner Yüzügüler; Pascal Frossard; L. Andrea Dunbar

早期終了を使用した深層ニューラルネットワークの階層トレーニング

ディープニューラルネットワークは、ビジョンタスクに最先端の精度を提供しますが、トレーニングにはかなりのリソースが必要です。したがって、それらは、データを取得するエッジデバイスから離れたクラウドサーバーでトレーニングされます。この問題により、通信コスト、実行時間、およびプライバシーに関する懸念が高まります。この研究では、早期終了を使用してエッジワーカーとクラウドワーカーの間でアーキテクチャを分割することにより、通信コスト、トレーニングランタイム、およびプライバシーの問題を軽減する、ディープニューラルネットワークの新しい階層トレーニング方法が提案されています。この方法は、トレーニングフェーズ中にエッジとクラウドの間でニューラルネットワークのバックワードパスを分離するための早期終了のまったく新しいユースケースを提案します。トレーニングフェーズのシーケンシャルな性質のために、階層のレベルを同時にトレーニングできない、またはプライバシーを犠牲にして行う、ほとんどの利用可能な階層トレーニング方法の問題に対処します。これらのスキームとは対照的に、私たちの方法はエッジワーカーとクラウドワーカーの両方を同時に使用でき、生の入力データをクラウドと共有せず、バックワードパス中の通信を必要としません。この方法の有効性を実証するために、さまざまなニューラルネットワークアーキテクチャのいくつかのシミュレーションとデバイス上での実験が行われます。クラウドとの通信が 3G プロトコルで行われる場合、VGG-16 と ResNet-18 の CIFAR-10 分類実験で、この方法により実行時間が 29% と 61% 短縮されることが示されています。このランタイムの向上は達成されますが、精度の低下は無視できます。この方法は、エッジクラウドシステムの一部として、携帯電話やロボットなどの低リソースデバイスで高精度のディープニューラルネットワークのオンライン学習を提供するためのインスピレーションを与える可能性があり、新しいタスクやデータのクラスに直面する際の柔軟性を高めます。未来。

Deep Neural Networks provide state-of-the-art accuracy for vision tasks but they require significant resources for training. Thus, they are trained on cloud servers far from the edge devices that acquire the data. This issue increases communication cost, runtime and privacy concerns. In this study, a novel hierarchical training method for deep neural networks is proposed that reduces the communication cost, training runtime, and privacy concerns by dividing the architecture between edge and cloud workers using early exits. The method proposes a brand-new use case for early exits to separate the backward pass of neural networks between the edge and the cloud during the training phase. We address the issues of most available hierarchical training methods that due to the sequential nature of the training phase, cannot train the levels of hierarchy at the same time or they do it with the cost of privacy. In contrast to these schemes, our method can use both edge and cloud workers simultaneously, does not share the raw input data with the cloud, and does not require communication during the backward pass. Several simulations and on-device experiments for different neural network architectures are done to demonstrate the effectiveness of this method. It is shown that the method reduces 29% and 61% runtime in CIFAR-10 classification experiment for VGG-16 and ResNet-18 when the communication with the cloud is done over the 3G protocol. This gain in the runtime is achieved whilst the accuracy drop is negligible. This method can be inspirational to provide online learning of high-accuracy deep neural networks on low-resource devices such as mobile phones or robots as a part of an edge-cloud system, making them more flexible in facing new tasks and classes of data in the future.

updated: Sat Mar 04 2023 11:30:16 GMT+0000 (UTC)

published: Sat Mar 04 2023 11:30:16 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト