Denoised Internal Models: a Brain-Inspired Autoencoder against Adversarial Attacks

Kaiyuan Liu; Xingyu Li; Yi Zhou; Jisong Guan; Yurui Lai; Ge Zhang; Hang Su; Jiachen Wang; Chunxu Guo

ノイズ除去された内部モデル：敵対的な攻撃に対する脳に触発されたオートエンコーダ

その大きな成功にもかかわらず、ディープラーニングは堅牢性に深刻な問題を抱えています。つまり、ディープニューラルネットワークは、最も単純なものであっても、敵対的な攻撃に対して非常に脆弱です。脳科学の最近の進歩に触発されて、この課題に取り組むための新しい生成オートエンコーダベースのモデルであるノイズ除去内部モデル（DIM）を提案します。視覚信号処理のために人間の脳のパイプラインをシミュレートするDIMは、2段階のアプローチを採用しています。最初の段階では、DIMは視床での情報の前処理を反映して、ノイズ除去装置を使用して入力のノイズと次元を削減します。一次視覚野の記憶関連トレースのまばらなコーディングから着想を得て、第2段階では、カテゴリごとに1つずつ、一連の内部モデルを生成します。 42を超える敵対攻撃に対してDIMを評価し、DIMがすべての攻撃に対して効果的に防御し、全体的な堅牢性においてSOTAよりも優れていることを示しています。

Despite its great success, deep learning severely suffers from robustness; that is, deep neural networks are very vulnerable to adversarial attacks, even the simplest ones. Inspired by recent advances in brain science, we propose the Denoised Internal Models (DIM), a novel generative autoencoder-based model to tackle this challenge. Simulating the pipeline in the human brain for visual signal processing, DIM adopts a two-stage approach. In the first stage, DIM uses a denoiser to reduce the noise and the dimensions of inputs, reflecting the information pre-processing in the thalamus. Inspired from the sparse coding of memory-related traces in the primary visual cortex, the second stage produces a set of internal models, one for each category. We evaluate DIM over 42 adversarial attacks, showing that DIM effectively defenses against all the attacks and outperforms the SOTA on the overall robustness.

updated: Sun Nov 21 2021 15:46:30 GMT+0000 (UTC)

published: Sun Nov 21 2021 15:46:30 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト