Detecting Adversarial Examples by Input Transformations, Defense Perturbations, and Voting

Federico Nesti; Alessandro Biondi; Giorgio Buttazzo

入力変換、防御摂動、および投票による敵対的な例の検出

過去数年にわたって、畳み込みニューラルネットワーク（CNN）は、視覚認識タスクで超人的なパフォーマンスに到達することが証明されています。ただし、CNNは、敵対的な例、つまり、ネットワークに誤った出力を予測させる一方で、正しい出力が予測される画像と非常に類似している、悪意を持って作成された画像に簡単にだまされる可能性があります。通常の敵対的な例は、入力画像の変換に対してロバストではありません。入力画像の変換は、敵対的な例がネットワークに提示されているかどうかを検出するために使用できます。それでもなお、そのような変換に対してロバストな敵対的例を生成することは可能です。この論文は、画像変換を介した敵対的な例の検出を広範囲に調査し、敵対的な例がロバストであるのと同じ入力変換でロバストな敵対的な例を検出するための防御摂動と呼ばれる新しい方法論を提案します。このような防御の混乱は、強力な敵対的な例に対する効果的な対抗策であることが示されています。さらに、マルチネットワークの敵対的な例が紹介されています。この種の敵対的な例は、複数のネットワークを同時にだますために使用できます。これは、複数のCNNに多数決するアーキテクチャに基づくシステムなど、ネットワークの冗長性を使用するシステムでは重要です。 Imagenetデータセットでトレーニングされた最先端のCNNに基づく広範な実験セットが最終的に報告されます。

Over the last few years, convolutional neural networks (CNNs) have proved to reach super-human performance in visual recognition tasks. However, CNNs can easily be fooled by adversarial examples, i.e., maliciously-crafted images that force the networks to predict an incorrect output while being extremely similar to those for which a correct output is predicted. Regular adversarial examples are not robust to input image transformations, which can then be used to detect whether an adversarial example is presented to the network. Nevertheless, it is still possible to generate adversarial examples that are robust to such transformations. This paper extensively explores the detection of adversarial examples via image transformations and proposes a novel methodology, called defense perturbation, to detect robust adversarial examples with the same input transformations the adversarial examples are robust to. Such a defense perturbation is shown to be an effective counter-measure to robust adversarial examples. Furthermore, multi-network adversarial examples are introduced. This kind of adversarial examples can be used to simultaneously fool multiple networks, which is critical in systems that use network redundancy, such as those based on architectures with majority voting over multiple CNNs. An extensive set of experiments based on state-of-the-art CNNs trained on the Imagenet dataset is finally reported.

updated: Mon Aug 16 2021 10:40:50 GMT+0000 (UTC)

published: Wed Jan 27 2021 14:50:41 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト