Hierarchical VAEs Know What They Don't Know

Jakob D. Havtorn; Jes Frellsen; Søren Hauberg; Lars Maaløe

階層型VAEは、知らないことを知っています

深い生成モデルは、それ自体が最先端の密度推定量であることを示しています。それでも、最近の研究では、トレーニング分布の外部からのデータに高い可能性を割り当てることがよくあることがわかりました。この一見逆説的な振る舞いは、達成された密度推定の品質に対する懸念を引き起こしました。階層型変分オートエンコーダーのコンテキストでは、配布中の低レベルの機能を持つ配布外のデータによってこの動作を説明する証拠を提供します。これは予想される動作であり、望ましい動作であると私たちは主張します。この洞察を手に、OOD検出のための高速でスケーラブルな完全に監視されていない尤度比スコアを開発します。これには、すべての機能レベルにわたってデータが分散されている必要があります。膨大なデータとモデルの組み合わせでメソッドのベンチマークを行い、分布外の検出で最先端の結果を達成します。

Deep generative models have shown themselves to be state-of-the-art density estimators. Yet, recent work has found that they often assign a higher likelihood to data from outside the training distribution. This seemingly paradoxical behavior has caused concerns over the quality of the attained density estimates. In the context of hierarchical variational autoencoders, we provide evidence to explain this behavior by out-of-distribution data having in-distribution low-level features. We argue that this is both expected and desirable behavior. With this insight in hand, we develop a fast, scalable and fully unsupervised likelihood-ratio score for OOD detection that requires data to be in-distribution across all feature-levels. We benchmark the method on a vast set of data and model combinations and achieve state-of-the-art results on out-of-distribution detection.

updated: Mon Mar 01 2021 09:35:30 GMT+0000 (UTC)

published: Tue Feb 16 2021 16:08:04 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト