GreedyFool: Multi-Factor Imperceptibility and Its Application to Designing a Black-box Adversarial Attack

Hui Liu; Bo Zhao; Minzhi Ji; Peng Liu

GreedyFool：多要素の知覚不能性とブラックボックスの敵対的攻撃の設計へのその応用

敵対的な例は、適切に設計された入力サンプルであり、摂動は人間の目には知覚できませんが、ディープニューラルネットワーク（DNN）の出力を簡単に誤解させます。既存の作品は、顕著なアーティファクトを生成する人間の視覚系（HVS）を十分に考慮していない、単純なメトリックを活用して摂動にペナルティを課すことにより、敵対的な例を統合します。摂動が見える理由を探るために、この論文では、人間の目の知覚に影響を与える4つの主要な要因を要約します。この調査に基づいて、良性の例と敵対的な例の間の知覚損失を測定するための多要素メトリックMulFactorLossを設計します。マルチファクターメトリックの知覚不能性をテストするために、GreedyFoolと呼ばれる新しいブラックボックスの敵対的攻撃を提案します。 GreedyFoolは、差分進化を適用して、ターゲットDNNの信頼性に対する摂動ピクセルの影響を評価し、貪欲な近似を導入して、敵対的な摂動を自動的に生成します。 ImageNetおよびCIFRA-10データセットで広範な実験を行い、60人の参加者を対象に包括的なユーザー調査を実施します。実験結果は、MulFactorLossが既存のピクセル単位のメトリックよりも知覚できないメトリックであり、GreedyFoolがブラックボックス方式で100％の成功率を達成することを示しています。

Adversarial examples are well-designed input samples, in which perturbations are imperceptible to the human eyes, but easily mislead the output of deep neural networks (DNNs). Existing works synthesize adversarial examples by leveraging simple metrics to penalize perturbations, that lack sufficient consideration of the human visual system (HVS), which produces noticeable artifacts. To explore why the perturbations are visible, this paper summarizes four primary factors affecting the perceptibility of human eyes. Based on this investigation, we design a multi-factor metric MulFactorLoss for measuring the perceptual loss between benign examples and adversarial ones. In order to test the imperceptibility of the multi-factor metric, we propose a novel black-box adversarial attack that is referred to as GreedyFool. GreedyFool applies differential evolution to evaluate the effects of perturbed pixels on the confidence of a target DNN, and introduces greedy approximation to automatically generate adversarial perturbations. We conduct extensive experiments on the ImageNet and CIFRA-10 datasets and a comprehensive user study with 60 participants. The experimental results demonstrate that MulFactorLoss is a more imperceptible metric than the existing pixelwise metrics, and GreedyFool achieves a 100% success rate in a black-box manner.

updated: Mon Nov 29 2021 03:10:31 GMT+0000 (UTC)

published: Wed Oct 14 2020 07:45:06 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト