Staircase Sign Method for Boosting Adversarial Attacks

Qilong Zhang; Xiaosu Zhu; Jingkuan Song; Lianli Gao; Heng Tao Shen

敵対的攻撃を後押しするための階段サイン法

転送ベースの攻撃の敵対的な例を作成することは困難であり、研究のホットスポットのままです。現在、このような攻撃方法は、代替モデルと被害者モデルが同様の決定境界を学習するという仮説に基づいており、従来は、結果として生じる摂動として勾配を操作するために符号法（SM）を適用します。 SMは効率的ですが、勾配単位の符号を抽出するだけで、それらの値の差を無視します。これにより、必然的に偏差が発生します。したがって、この問題を軽減し、攻撃を促進するために、新しい階段記号法（S ^ 2M）を提案します。技術的には、私たちの方法は、勾配単位の値に従って勾配記号をヒューリスティックにいくつかのセグメントに分割し、次に各セグメントに階段の重みを割り当てて、敵対的な摂動をより適切に作成します。その結果、私たちの敵対的な例は、ホワイトボックスとブラックボックスの両方の方法で、目立たなくてもパフォーマンスが向上します。 S ^ 2Mは結果の勾配を操作するだけなので、私たちの方法は一般にFGSMアルゴリズムのファミリーに統合でき、計算のオーバーヘッドはごくわずかです。 ImageNetデータセットでの広範な実験は、提案された方法の有効性を示しています。これにより、転送可能性が大幅に向上します（つまり、通常のトレーニングを受けたモデルでは平均5.1％、敵対的にトレーニングされた防御では12.8％）。私たちのコードはhttps://github.com/qilong-zhang/Staircase-sign-methodで入手できます。

Crafting adversarial examples for the transfer-based attack is challenging and remains a research hot spot. Currently, such attack methods are based on the hypothesis that the substitute model and the victim model learn similar decision boundaries, and they conventionally apply Sign Method (SM) to manipulate the gradient as the resultant perturbation. Although SM is efficient, it only extracts the sign of gradient units but ignores their value difference, which inevitably leads to a deviation. Therefore, we propose a novel Staircase Sign Method (S^2M) to alleviate this issue, thus boosting attacks. Technically, our method heuristically divides the gradient sign into several segments according to the values of the gradient units, and then assigns each segment with a staircase weight for better crafting adversarial perturbation. As a result, our adversarial examples perform better in both white-box and black-box manner without being more visible. Since S^2M just manipulates the resultant gradient, our method can be generally integrated into the family of FGSM algorithms, and the computational overhead is negligible. Extensive experiments on the ImageNet dataset demonstrate the effectiveness of our proposed methods, which significantly improve the transferability (i.e., on average, 5.1% for normally trained models and 12.8% for adversarially trained defenses). Our code is available at https://github.com/qilong-zhang/Staircase-sign-method.

updated: Tue Apr 12 2022 08:16:12 GMT+0000 (UTC)

published: Tue Apr 20 2021 02:31:55 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト