MOCCA: Multi-Layer One-Class ClassificAtion for Anomaly Detection

Fabio Valerio Massoli; Fabrizio Falchi; Alperen Kantarci; Şeymanur Akti; Hazim Kemal Ekenel; Giuseppe Amato

MOCCA：異常検出のための多層1クラス分類

異常はすべての科学分野に遍在しており、データ分布に関する知識が不完全であるか、突然作用して観測を歪める未知のプロセスが原因で、予期しないイベントを表す可能性があります。このようなイベントはまれであるため、異常検出（AD）タスクで深層学習モデルをトレーニングするために、科学者は「通常の」データ、つまり異常でないサンプルのみに依存します。したがって、ニューラルネットワークに入力データの下の分布を推測させます。このような状況で、ADタスクの深層学習モデルをトレーニングおよびテストするためのMulti-layer One-Class ClassificAtion（MOCCA）という名前の新しいフレームワークを提案します。具体的には、オートエンコーダーに適用しました。私たちの仕事の重要な目新しさは、ADタスクの中間表現の明示的な最適化に由来します。実際、ニューラルネットワークを単一の計算ブロックと見なす一般的に使用されるアプローチとは異なり、つまり、最後の層の出力のみを使用する場合、MOCCAは深いアーキテクチャの多層構造を明示的に活用します。各レイヤーの特徴空間はトレーニング中にAD用に最適化され、テストフェーズでは、トレーニングされたレイヤーから抽出された深い表現が組み合わされて異常が検出されます。 MOCCAでは、トレーニングプロセスを2つのステップに分割しました。まず、オートエンコーダは再構築タスクについてのみトレーニングされます。次に、考慮される各レイヤーで、出力表現と参照ポイント（異常のないトレーニングデータ重心）との間のL_2距離を最小化するタスクを持つエンコーダーのみを保持します。続いて、エンコーダモデルのさまざまなトレーニング済みレイヤーで抽出された深い特徴を組み合わせて、推論時に異常を検出します。 MOCCAでトレーニングされたモデルのパフォーマンスを評価するために、公開されているデータセットに対して広範な実験を行います。我々は、提案された方法が、文献で利用可能な最先端のアプローチと同等またはそれ以上の性能に達することを示しています。

Anomalies are ubiquitous in all scientific fields and can express an unexpected event due to incomplete knowledge about the data distribution or an unknown process that suddenly comes into play and distorts observations. Due to such events' rarity, to train deep learning models on the Anomaly Detection (AD) task, scientists only rely on "normal" data, i.e., non-anomalous samples. Thus, letting the neural network infer the distribution beneath the input data. In such a context, we propose a novel framework, named Multi-layer One-Class ClassificAtion (MOCCA),to train and test deep learning models on the AD task. Specifically, we applied it to autoencoders. A key novelty in our work stems from the explicit optimization of intermediate representations for the AD task. Indeed, differently from commonly used approaches that consider a neural network as a single computational block, i.e., using the output of the last layer only, MOCCA explicitly leverages the multi-layer structure of deep architectures. Each layer's feature space is optimized for AD during training, while in the test phase, the deep representations extracted from the trained layers are combined to detect anomalies. With MOCCA, we split the training process into two steps. First, the autoencoder is trained on the reconstruction task only. Then, we only retain the encoder tasked with minimizing the L_2 distance between the output representation and a reference point, the anomaly-free training data centroid, at each considered layer. Subsequently, we combine the deep features extracted at the various trained layers of the encoder model to detect anomalies at inference time. To assess the performance of the models trained with MOCCA, we conduct extensive experiments on publicly available datasets. We show that our proposed method reaches comparable or superior performance to state-of-the-art approaches available in the literature.

updated: Sat Nov 27 2021 09:38:49 GMT+0000 (UTC)

published: Wed Dec 09 2020 08:32:56 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト