Detect and Defense Against Adversarial Examples in Deep Learning using Natural Scene Statistics and Adaptive Denoising

Anouar Kherchouche; Sid Ahmed Fezza; Wassim Hamidouche

自然シーン統計と適応型ノイズ除去を使用したディープラーニングの敵対的な例に対する検出と防御

ディープニューラルネットワーク（DNN）の膨大なパフォーマンスにもかかわらず、最近の研究では、敵対的な例（AE）、つまり、ターゲットを絞ったDNNをだますように設計された慎重に摂動された入力に対する脆弱性が示されています。現在、そのようなAEを作成するための多くの効果的な攻撃が文献に豊富に含まれています。一方、この脆弱性を軽減するために、多くの防御戦略が開発されてきました。ただし、後者は特定の攻撃に対して有効性を示しており、一般化されていません。さまざまな攻撃にうまく対応できません。この論文では、敵対的なサンプルからDNN分類器を防御するためのフレームワークを提案します。提案された方法は、別個の検出器とノイズ除去ブロックを含む2段階のフレームワークに基づいています。検出器は、自然シーン統計（NSS）を使用してAEを特徴付けることにより、AEを検出することを目的としています。ここでは、これらの統計的特徴が敵対的摂動の存在によって変化することを示しています。デノイザーは、畳み込みニューラルネットワーク（CNN）によって推定された最適なしきい値によって供給されるブロックマッチング3D（BM3D）フィルターに基づいており、AEとして検出されたサンプルをデータ多様体に投影します。 MNIST、CIFAR-10、Tiny-ImageNetの3つの標準データセットで完全な評価を実施しました。実験結果は、提案された防御方法が、ブラックボックス、グレーボックス、およびホワイトボックスの設定の下での一連の攻撃に対する堅牢性を改善することにより、最先端の防御技術よりも優れていることを示しています。ソースコードはhttps://github.com/kherchouche-anouar/2DAEで入手できます。

Despite the enormous performance of deepneural networks (DNNs), recent studies have shown theirvulnerability to adversarial examples (AEs), i.e., care-fully perturbed inputs designed to fool the targetedDNN. Currently, the literature is rich with many ef-fective attacks to craft such AEs. Meanwhile, many de-fenses strategies have been developed to mitigate thisvulnerability. However, these latter showed their effec-tiveness against specific attacks and does not general-ize well to different attacks. In this paper, we proposea framework for defending DNN classifier against ad-versarial samples. The proposed method is based on atwo-stage framework involving a separate detector anda denoising block. The detector aims to detect AEs bycharacterizing them through the use of natural scenestatistic (NSS), where we demonstrate that these statis-tical features are altered by the presence of adversarialperturbations. The denoiser is based on block matching3D (BM3D) filter fed by an optimum threshold valueestimated by a convolutional neural network (CNN) toproject back the samples detected as AEs into theirdata manifold. We conducted a complete evaluation onthree standard datasets namely MNIST, CIFAR-10 andTiny-ImageNet. The experimental results show that theproposed defense method outperforms the state-of-the-art defense techniques by improving the robustnessagainst a set of attacks under black-box, gray-box and white-box settings. The source code is available at: https://github.com/kherchouche-anouar/2DAE

updated: Mon Jul 12 2021 23:45:44 GMT+0000 (UTC)

published: Mon Jul 12 2021 23:45:44 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト