Towards Understanding and Harnessing the Effect of Image Transformation in Adversarial Detection

Hui Liu; Bo Zhao; Yuefeng Peng; Weidong Li; Peng Liu

敵対的検出における画像変換の効果の理解と活用に向けて

ディープニューラルネットワーク（DNN）は、敵対的な例によって脅かされています。敵対的な画像と良性の画像を区別する敵対的な検出は、堅牢なDNNベースのサービスの基本です。画像変換は、敵対的な例を検出するための最も効果的なアプローチの1つです。過去数年の間に、信頼できる敵対的検出器を設計するために、さまざまな画像変換が研究され、議論されてきました。本論文では、新しい分類法を用いた画像変換による敵対的検出の最近の進歩を体系的に統合した。次に、最先端の敵対的攻撃に対する画像変換の検出パフォーマンスをテストするために、広範な実験を実施します。さらに、個々の変換が敵対的な例を確実に検出できないことを明らかにし、9つの画像変換のスコアを組み合わせたAdvJudgeと呼ばれるDNNベースのアプローチを提案します。どの個々のスコアが誤解を招くか、誤解を招かないかを知らなくても、AdvJudgeは正しい判断を下し、検出率を大幅に向上させることができます。最後に、説明可能なAIツールを使用して、敵対的な検出に対する各画像変換の寄与を示します。実験結果は、敵対的検出への画像変換の寄与が大幅に異なることを示しています。それらの組み合わせにより、最先端の敵対的攻撃に対する一般的な検出能力を大幅に向上させることができます。

Deep neural networks (DNNs) are threatened by adversarial examples. Adversarial detection, which distinguishes adversarial images from benign images, is fundamental for robust DNN-based services. Image transformation is one of the most effective approaches to detect adversarial examples. During the last few years, a variety of image transformations have been studied and discussed to design reliable adversarial detectors. In this paper, we systematically synthesize the recent progress on adversarial detection via image transformations with a novel classification method. Then, we conduct extensive experiments to test the detection performance of image transformations against state-of-the-art adversarial attacks. Furthermore, we reveal that each individual transformation is not capable of detecting adversarial examples in a robust way, and propose a DNN-based approach referred to as AdvJudge, which combines scores of 9 image transformations. Without knowing which individual scores are misleading or not misleading, AdvJudge can make the right judgment, and achieve a significant improvement in detection rate. Finally, we utilize an explainable AI tool to show the contribution of each image transformation to adversarial detection. Experimental results show that the contribution of image transformations to adversarial detection is significantly different, the combination of them can significantly improve the generic detection ability against state-of-the-art adversarial attacks.

updated: Thu May 26 2022 15:21:05 GMT+0000 (UTC)

published: Tue Jan 04 2022 10:58:59 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト