Towards Transferable Unrestricted Adversarial Examples with Minimum Changes

Fangcheng Liu; Chao Zhang; Hongyang Zhang

最小限の変更で譲渡可能な無制限の敵対的例に向けて

転送ベースの敵対的例は、ブラックボックス攻撃の最も重要なクラスの 1 つです。ただし、転送可能性と敵対的摂動の知覚不能性との間にはトレードオフがあります。この方向の以前の作業では、多くの場合、適切な転送成功率に到達するために、固定されているが大きな ℓ_p-ノルム摂動バジェットが必要であり、知覚可能な敵対摂動につながります。一方、意味を保存する摂動を生成することを目的とした現在の無制限の敵対的攻撃のほとんどは、ターゲットモデルへの伝達性が弱いという欠点があります。この作業では、最小限の変更で転送可能な敵対的な例を生成するためのジオメトリ認識フレームワークを提案します。統計的機械学習におけるモデル選択と同様に、検証モデルを活用して、ℓ_∞ ノルムと無制限の脅威モデルの両方で、各画像に最適な摂動バジェットを選択します。グループ外の類似性にペナルティを課しながら、グループ内の多様性を促進することにより、トレーニングモデルと検証モデルの分割のための原則的な方法を提案します。大規模な実験により、細工された敵対的な例の知覚不能性と転送可能性のバランスをとるフレームワークの有効性が検証されます。この方法論は、CVPR'21 Security AI Challenger: Unrestricted Adversarial Attacks on ImageNet へのエントリの基盤であり、1,559 チーム中 1 位にランク付けされ、次点の提出物を最終的に 4.59% および 23.91% 上回っています。それぞれスコアと平均画質レベル。コードは https://github.com/Equationliu/GA-Attack で入手できます。

Transfer-based adversarial example is one of the most important classes of black-box attacks. However, there is a trade-off between transferability and imperceptibility of the adversarial perturbation. Prior work in this direction often requires a fixed but large ℓ_p-norm perturbation budget to reach a good transfer success rate, leading to perceptible adversarial perturbations. On the other hand, most of the current unrestricted adversarial attacks that aim to generate semantic-preserving perturbations suffer from weaker transferability to the target model. In this work, we propose a geometry-aware framework to generate transferable adversarial examples with minimum changes. Analogous to model selection in statistical machine learning, we leverage a validation model to select the best perturbation budget for each image under both the ℓ_∞-norm and unrestricted threat models. We propose a principled method for the partition of training and validation models by encouraging intra-group diversity while penalizing extra-group similarity. Extensive experiments verify the effectiveness of our framework on balancing imperceptibility and transferability of the crafted adversarial examples. The methodology is the foundation of our entry to the CVPR'21 Security AI Challenger: Unrestricted Adversarial Attacks on ImageNet, in which we ranked 1st place out of 1,559 teams and surpassed the runner-up submissions by 4.59% and 23.91% in terms of final score and average image quality level, respectively. Code is available at https://github.com/Equationliu/GA-Attack.

updated: Tue Dec 27 2022 12:47:14 GMT+0000 (UTC)

published: Tue Jan 04 2022 12:03:20 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト