EARLIN: Early Out-of-Distribution Detection for Resource-efficient Collaborative Inference

Sumaiya Tabassum Nimi; Md Adnan Arefeen; Md Yusuf Sarwar Uddin; Yugyung Lee

EARLIN：リソース効率の高い協調的推論のための早期の分布外検出

協調的推論により、リソースに制約のあるエッジデバイスは、大量の深層学習モデルが実行されるサーバー（クラウドなど）に入力（画像など）をアップロードすることで推論を行うことができます。この設定は、推論を成功させるために費用効果的に機能しますが、モデルがトレーニングされていない入力サンプル（Out-of-Distribution（OOD）サンプルとして知られている）に直面すると、パフォーマンスが大幅に低下します。エッジデバイスが少なくとも入力サンプルがOODであることを検出できれば、推論ワークロードのためにそれらの入力をサーバーにアップロードしないことで、通信および計算リソースを節約できる可能性があります。この論文では、事前にトレーニングされたCNNモデルの浅い層から重要な特徴をマイニングし、縮小された特徴空間で定義された距離関数に基づいてID（分布内）またはOODとして入力サンプルを検出する新しい軽量OOD検出アプローチを提案します。。私たちの手法は、（a）事前トレーニングされたモデルで機能し、それらのモデルを再トレーニングすることなく、（b）OODデータセットに公開されません（すべての検出パラメーターはIDトレーニングデータセットから取得されます）。この目的のために、事前にトレーニングされたモデルを取得し、モデルをOOD検出レイヤーで分割し、かなり小さいOOD部分をエッジデバイスに展開し、残りをクラウドに展開するEARLIN（コラボレーティブ推論のEARLy OOD検出）を開発します。実際のデータセットとプロトタイプの実装を使用して実験することにより、ベンチマークデータセットで事前トレーニングされた人気のある深層学習モデルに加えて、人気のあるOODデータセットに対してテストした場合、全体的な精度とコストの点で、他のアプローチよりも優れた結果が得られることを示します。

Collaborative inference enables resource-constrained edge devices to make inferences by uploading inputs (e.g., images) to a server (i.e., cloud) where the heavy deep learning models run. While this setup works cost-effectively for successful inferences, it severely underperforms when the model faces input samples on which the model was not trained (known as Out-of-Distribution (OOD) samples). If the edge devices could, at least, detect that an input sample is an OOD, that could potentially save communication and computation resources by not uploading those inputs to the server for inference workload. In this paper, we propose a novel lightweight OOD detection approach that mines important features from the shallow layers of a pretrained CNN model and detects an input sample as ID (In-Distribution) or OOD based on a distance function defined on the reduced feature space. Our technique (a) works on pretrained models without any retraining of those models, and (b) does not expose itself to any OOD dataset (all detection parameters are obtained from the ID training dataset). To this end, we develop EARLIN (EARLy OOD detection for Collaborative INference) that takes a pretrained model and partitions the model at the OOD detection layer and deploys the considerably small OOD part on an edge device and the rest on the cloud. By experimenting using real datasets and a prototype implementation, we show that our technique achieves better results than other approaches in terms of overall accuracy and cost when tested against popular OOD datasets on top of popular deep learning models pretrained on benchmark datasets.

updated: Fri Jun 25 2021 18:43:23 GMT+0000 (UTC)

published: Fri Jun 25 2021 18:43:23 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト