Improving Variational Autoencoder based Out-of-Distribution Detection for Embedded Real-time Applications

Yeli Feng; Daniel Jun Xian Ng; Arvind Easwaran

組み込みリアルタイムアプリケーションのための変分オートエンコーダベースの分布外検出の改善

機械学習の不確実性は、セーフティクリティカルなサイバーフィジカルシステム（CPS）に適用するための重要な障害です。不確実性の原因の1つは、トレーニングシナリオとテストシナリオの間の入力データの分布の変化から生じます。このような分布の変化をリアルタイムで検出することは、この課題に対処するための新たなアプローチです。イメージングを含むCPSアプリケーションの高次元入力スペースは、タスクにさらに困難を追加します。ジェネレーティブ学習モデルは、タスク、つまり分布外（OoD）検出に広く採用されています。最先端技術を向上させるために、機械学習とCPSの両方の分野からの既存の提案を調査しました。後者では、自動運転エージェントのリアルタイムでの安全監視に焦点が当てられています。ビデオの動きの時空間相関を利用して、自動運転エージェントの周りの危険な動きを確実に検出できます。変分オートエンコーダ（VAE）の理論と実践における最新の進歩に触発されて、OoD検出の堅牢性をさらに高めるために、データの事前知識を活用しました。 nuScenesとSynthiaのデータセットに関する比較研究は、私たちの方法が運転シナリオに固有のOoD要因の検出能力を大幅に改善し、最先端のアプローチよりも42％優れていることを示しています。私たちのモデルはまた、ほぼ完全に一般化されており、実世界および実験されたシミュレーション駆動データセット全体で最先端のものより97％優れています。最後に、提案された1つの方法をツインエンコーダモデルにカスタマイズしました。このモデルは、リソースが限られた組み込みデバイスに展開して、リアルタイムのOoD検出を行うことができます。その実行時間は、低精度の8ビット整数推論で4倍以上短縮されましたが、検出機能は対応する浮動小数点モデルに匹敵します。

Uncertainties in machine learning are a significant roadblock for its application in safety-critical cyber-physical systems (CPS). One source of uncertainty arises from distribution shifts in the input data between training and test scenarios. Detecting such distribution shifts in real-time is an emerging approach to address the challenge. The high dimensional input space in CPS applications involving imaging adds extra difficulty to the task. Generative learning models are widely adopted for the task, namely out-of-distribution (OoD) detection. To improve the state-of-the-art, we studied existing proposals from both machine learning and CPS fields. In the latter, safety monitoring in real-time for autonomous driving agents has been a focus. Exploiting the spatiotemporal correlation of motion in videos, we can robustly detect hazardous motion around autonomous driving agents. Inspired by the latest advances in the Variational Autoencoder (VAE) theory and practice, we tapped into the prior knowledge in data to further boost OoD detection's robustness. Comparison studies over nuScenes and Synthia data sets show our methods significantly improve detection capabilities of OoD factors unique to driving scenarios, 42% better than state-of-the-art approaches. Our model also generalized near-perfectly, 97% better than the state-of-the-art across the real-world and simulation driving data sets experimented. Finally, we customized one proposed method into a twin-encoder model that can be deployed to resource limited embedded devices for real-time OoD detection. Its execution time was reduced over four times in low-precision 8-bit integer inference, while detection capability is comparable to its corresponding floating-point model.

updated: Sun Jul 25 2021 07:52:53 GMT+0000 (UTC)

published: Sun Jul 25 2021 07:52:53 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト