Anomaly Detection With Partitioning Overfitting Autoencoder Ensembles

Boris Lorbeer; Max Botler

オーバーフィットオートエンコーダアンサンブルの分割による異常検出

この論文では、教師なし外れ値検出（UOD）の新しい方法であるPOTATOES（Partitioning OverfiTting AuTOencoder EnSemble）を提案します。より正確には、UODのオートエンコーダーがあれば、この手法を使用して精度を向上させると同時に、正則化を調整する負担を取り除くことができます。正則化するのではなく、データを十分な数の同じサイズのパーツにランダムに分割し、各パーツに独自のオートエンコーダーを過剰適合させ、すべてのオートエンコーダー再構築エラーの最大値を異常スコアとして使用するという考え方です。モデルをさまざまな現実的なデータセットに適用し、インライアのセットが十分に密集している場合、この方法によって特定のオートエンコーダーのUODパフォーマンスが大幅に向上することを示します。再現性のために、コードはgithubで利用できるようになっているため、読者はこのペーパーで結果を再現したり、他のオートエンコーダーやデータセットにメソッドを適用したりできます。

In this paper, we propose POTATOES (Partitioning OverfiTting AuTOencoder EnSemble), a new method for unsupervised outlier detection (UOD). More precisely, given any autoencoder for UOD, this technique can be used to improve its accuracy while at the same time removing the burden of tuning its regularization. The idea is to not regularize at all, but to rather randomly partition the data into sufficiently many equally sized parts, overfit each part with its own autoencoder, and to use the maximum over all autoencoder reconstruction errors as the anomaly score. We apply our model to various realistic datasets and show that if the set of inliers is dense enough, our method indeed improves the UOD performance of a given autoencoder significantly. For reproducibility, the code is made available on github so the reader can recreate the results in this paper as well as apply the method to other autoencoders and datasets.

updated: Fri Jul 09 2021 04:40:04 GMT+0000 (UTC)

published: Sun Sep 06 2020 15:35:53 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト