Admix: Enhancing the Transferability of Adversarial Attacks

Xiaosen Wang; Xuanran He; Jingdong Wang; Kun He

Admix：敵対的攻撃の転送可能性の強化

ディープニューラルネットワークは、ホワイトボックス設定での敵対的な例に対して非常に脆弱であることが知られています。さらに、サロゲート（ソース）モデルで作成された悪意のある攻撃者は、同じ学習タスクを持つがアーキテクチャが異なる他のモデルでブラックボックス転送可能性を示すことがよくあります。最近、敵対的な転送可能性を高めるためにさまざまな方法が提案されていますが、その中で入力変換は最も効果的なアプローチの1つです。この方向で調査し、既存の変換がすべて単一の画像に適用されていることを確認します。これにより、敵対的な転送可能性が制限される可能性があります。この目的のために、入力画像と他のカテゴリからランダムにサンプリングされた画像のセットを考慮するAdmixと呼ばれる新しい入力変換ベースの攻撃方法を提案します。 Admixは、元の入力の勾配を直接計算する代わりに、入力の元のラベルを使用してより転送可能な敵を作成しながら、各アドイン画像のごく一部と混合された入力画像の勾配を計算します。標準のImageNetデータセットでの経験的評価は、Admixが、単一モデル設定とアンサンブルモデル設定の両方で、既存の入力変換方法よりも大幅に優れた転送可能性を実現できることを示しています。既存の入力変換を組み込むことにより、私たちの方法は、転送可能性をさらに改善し、アンサンブルモデル設定で9つの高度な防御モデルを攻撃するときに、入力変換の最先端の組み合わせを明確なマージンで上回ります。コードはhttps://github.com/JHL-HUST/Admixで入手できます。

Deep neural networks are known to be extremely vulnerable to adversarial examples under white-box setting. Moreover, the malicious adversaries crafted on the surrogate (source) model often exhibit black-box transferability on other models with the same learning task but having different architectures. Recently, various methods are proposed to boost the adversarial transferability, among which the input transformation is one of the most effective approaches. We investigate in this direction and observe that existing transformations are all applied on a single image, which might limit the adversarial transferability. To this end, we propose a new input transformation based attack method called Admix that considers the input image and a set of images randomly sampled from other categories. Instead of directly calculating the gradient on the original input, Admix calculates the gradient on the input image admixed with a small portion of each add-in image while using the original label of the input to craft more transferable adversaries. Empirical evaluations on standard ImageNet dataset demonstrate that Admix could achieve significantly better transferability than existing input transformation methods under both single model setting and ensemble-model setting. By incorporating with existing input transformations, our method could further improve the transferability and outperforms the state-of-the-art combination of input transformations by a clear margin when attacking nine advanced defense models under ensemble-model setting. Code is available at https://github.com/JHL-HUST/Admix.

updated: Wed Aug 18 2021 06:29:57 GMT+0000 (UTC)

published: Sun Jan 31 2021 11:40:50 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト