Meta Gradient Adversarial Attack

Zheng Yuan; Jie Zhang; Yunpei Jia; Chuanqi Tan; Tao Xue; Shiguang Shan

メタグラディエント敵対攻撃

近年、敵対的攻撃に関する研究がホットスポットになっています。転送ベースの敵対的攻撃に関する現在の文献は、目に見えないブラックボックスモデルへの転送可能性を改善するための有望な結果を達成していますが、それでもまだ長い道のりがあります。メタ学習のアイデアに触発されて、この論文は、プラグアンドプレイであり、モデル間の転送可能性を改善するために既存の勾配ベースの攻撃方法と統合できるメタ勾配敵対攻撃（MGAA）と呼ばれる新しいアーキテクチャを提案します。。具体的には、モデル動物園から複数のモデルをランダムにサンプリングしてさまざまなタスクを作成し、各タスクでホワイトボックス攻撃とブラックボックス攻撃を繰り返しシミュレートします。ホワイトボックス攻撃とブラックボックス攻撃の勾配方向の間のギャップを狭めることにより、ブラックボックス設定での敵対的な例の転送可能性を向上させることができます。 CIFAR10およびImageNetデータセットでの広範な実験は、私たちのアーキテクチャがブラックボックスとホワイトボックスの両方の攻撃設定で最先端の方法よりも優れていることを示しています。

In recent years, research on adversarial attacks has become a hot spot. Although current literature on the transfer-based adversarial attack has achieved promising results for improving the transferability to unseen black-box models, it still leaves a long way to go. Inspired by the idea of meta-learning, this paper proposes a novel architecture called Meta Gradient Adversarial Attack (MGAA), which is plug-and-play and can be integrated with any existing gradient-based attack method for improving the cross-model transferability. Specifically, we randomly sample multiple models from a model zoo to compose different tasks and iteratively simulate a white-box attack and a black-box attack in each task. By narrowing the gap between the gradient directions in white-box and black-box attacks, the transferability of adversarial examples on the black-box setting can be improved. Extensive experiments on the CIFAR10 and ImageNet datasets show that our architecture outperforms the state-of-the-art methods for both black-box and white-box attack settings.

updated: Mon Aug 09 2021 17:44:19 GMT+0000 (UTC)

published: Mon Aug 09 2021 17:44:19 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト