BottleFit: Learning Compressed Representations in Deep Neural Networks for Effective and Efficient Split Computing

Yoshitomo Matsubara; Davide Callegaro; Sameer Singh; Marco Levorato; Francesco Restuccia

BottleFit：効果的かつ効率的なスプリットコンピューティングのためのディープニューラルネットワークでの圧縮表現の学習

ミッションクリティカルなアプリケーションではディープニューラルネットワーク（DNN）を使用する必要がありますが、モバイルデバイスで継続的に実行すると、エネルギー消費量が大幅に増加します。エッジオフロードはエネルギー消費を減らすことができますが、チャネル品質、ネットワーク、およびエッジサーバーの負荷の不規則なパターンは、システムの主要な操作の深刻な混乱につながる可能性があります。スプリットコンピューティングと呼ばれる別のアプローチでは、モデル内に圧縮された表現（「ボトルネック」と呼ばれる）を生成して、帯域幅の使用量とエネルギー消費量を削減します。以前の研究では、エネルギー消費と待ち時間を犠牲にして、追加のレイヤーを導入するアプローチが提案されています。このため、BottleFitと呼ばれる新しいフレームワークを提案します。これには、対象となるDNNアーキテクチャの変更に加えて、強力な圧縮率でも高精度を達成するための新しいトレーニング戦略が含まれています。画像分類の最先端のDNNモデルにBottleFitを適用し、BottleFitがImageNetデータセットで最大0.6％の精度低下で77.1％のデータ圧縮を達成する一方で、SPINNなどの最先端の精度では最大6％低下することを示します。 NVIDIA Jetson Nanoボード（GPUベース）とRaspberry PIボード（GPUレス）で実行されている画像分類アプリケーションの消費電力と遅延を実験的に測定します。 BottleFitは、（wrt）ローカルコンピューティングに関してそれぞれ最大49％と89％、エッジオフロードに対して37％と55％の電力消費とレイテンシーを削減することを示しています。また、BottleFitを最先端のオートエンコーダベースのアプローチと比較し、（i）BottleFitが消費電力と実行時間をそれぞれJetsonで最大54％と44％、Raspberryで最大40％と62％削減することを示しますPI; （ii）モバイルデバイスで実行されるヘッドモデルのサイズは83分の1です。結果の完全な再現性のために、コードリポジトリが公開されます。

Although mission-critical applications require the use of deep neural networks (DNNs), their continuous execution at mobile devices results in a significant increase in energy consumption. While edge offloading can decrease energy consumption, erratic patterns in channel quality, network and edge server load can lead to severe disruption of the system's key operations. An alternative approach, called split computing, generates compressed representations within the model (called "bottlenecks"), to reduce bandwidth usage and energy consumption. Prior work has proposed approaches that introduce additional layers, to the detriment of energy consumption and latency. For this reason, we propose a new framework called BottleFit, which, in addition to targeted DNN architecture modifications, includes a novel training strategy to achieve high accuracy even with strong compression rates. We apply BottleFit on cutting-edge DNN models in image classification, and show that BottleFit achieves 77.1% data compression with up to 0.6% accuracy loss on ImageNet dataset, while state of the art such as SPINN loses up to 6% in accuracy. We experimentally measure the power consumption and latency of an image classification application running on an NVIDIA Jetson Nano board (GPU-based) and a Raspberry PI board (GPU-less). We show that BottleFit decreases power consumption and latency respectively by up to 49% and 89% with respect to (w.r.t.) local computing and by 37% and 55% w.r.t. edge offloading. We also compare BottleFit with state-of-the-art autoencoders-based approaches, and show that (i) BottleFit reduces power consumption and execution time respectively by up to 54% and 44% on the Jetson and 40% and 62% on Raspberry PI; (ii) the size of the head model executed on the mobile device is 83 times smaller. The code repository will be published for full reproducibility of the results.

updated: Fri Jan 07 2022 22:08:07 GMT+0000 (UTC)

published: Fri Jan 07 2022 22:08:07 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト