Practical Evaluation of Adversarial Robustness via Adaptive Auto Attack

Ye Liu; Yaya Cheng; Lianli Gao; Xianglong Liu; Qilong Zhang; Jingkuan Song

アダプティブオートアタックによる敵対的ロバスト性の実用的評価

敵対的攻撃に対する防御モデルは大幅に成長しましたが、実用的な評価方法の欠如が進歩を妨げています。評価は、予算の反復回数とテストデータセットが与えられた場合に、防御モデルの堅牢性の下限を探すこととして定義できます。実用的な評価方法は、便利（つまり、パラメーターなし）、効率的（つまり、反復回数が少ない）、信頼性（つまり、ロバスト性の下限に近づく）である必要があります。この目標に向けて、テスト時間トレーニング方式で効率と信頼性に対処する、パラメータのない適応自動攻撃（A ^ 3）評価方法を提案します。具体的には、特定の防御モデルに対する敵対的な例が開始点でいくつかの規則に従うことを観察することにより、評価を高速化するための適応方向初期化戦略を設計します。さらに、予算の反復回数の下でロバスト性の下限に近づくために、攻撃しにくい画像を自動的に識別して破棄するオンライン統計ベースの破棄戦略を提案します。広範な実験により、A^3の有効性が実証されています。特に、広く使用されている50近くの防御モデルにA^3を適用します。既存の方法よりもはるかに少ない反復、つまり平均1/10（10倍の速度向上）を消費することにより、すべての場合でより低いロバスト精度を達成します。特に、この方法を使用したCVPR2021ホワイトボックスの防衛モデルに対する敵対的攻撃の競争で1681チームのうち1位を獲得しました。コードは次の場所で入手できます：https：//github.com/liuye6666/adaptive_auto_attackhttps：//github.com/liuye6666/adaptive \ _auto \ _attack

Defense models against adversarial attacks have grown significantly, but the lack of practical evaluation methods has hindered progress. Evaluation can be defined as looking for defense models' lower bound of robustness given a budget number of iterations and a test dataset. A practical evaluation method should be convenient (i.e., parameter-free), efficient (i.e., fewer iterations) and reliable (i.e., approaching the lower bound of robustness). Towards this target, we propose a parameter-free Adaptive Auto Attack (A^3) evaluation method which addresses the efficiency and reliability in a test-time-training fashion. Specifically, by observing that adversarial examples to a specific defense model follow some regularities in their starting points, we design an Adaptive Direction Initialization strategy to speed up the evaluation. Furthermore, to approach the lower bound of robustness under the budget number of iterations, we propose an online statistics-based discarding strategy that automatically identifies and abandons hard-to-attack images. Extensive experiments demonstrate the effectiveness of our A^3. Particularly, we apply A^3 to nearly 50 widely-used defense models. By consuming much fewer iterations than existing methods, i.e., 1/10 on average (10× speed up), we achieve lower robust accuracy in all cases. Notably, we won first place out of 1681 teams in CVPR 2021 White-box Adversarial Attacks on Defense Models competitions with this method. Code is available at: https://github.com/liuye6666/adaptive_auto_attackhttps://github.com/liuye6666/adaptive\_auto\_attack

updated: Thu Mar 10 2022 04:53:54 GMT+0000 (UTC)

published: Thu Mar 10 2022 04:53:54 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト