Learning to Generate Image Source-Agnostic Universal Adversarial Perturbations

Pu Zhao; Parikshit Ram; Songtao Lu; Yuguang Yao; Djallel Bouneffouf; Xue Lin; Sijia Liu

画像ソースにとらわれない普遍的な敵対的摂動の生成方法の学習

敵対的摂動は、深層学習モデルの堅牢性を証明するために重要です。 Universal adversarial perturbation (UAP) は複数の画像を同時に攻撃できるため、より統一された脅威モデルを提供し、画像単位の攻撃アルゴリズムを不要にします。ただし、既存の UAP ジェネレーターは、画像が異なる画像ソースから描画される場合 (たとえば、画像解像度が異なる場合) には不十分です。画像ソース全体での真の普遍性に向けて、UAP 生成の新しい見方を、少数ショット学習のカスタマイズされたインスタンスとして捉えます。これは、UAP 生成にバイレベル最適化と最適化学習 (L2O) 手法を活用し、攻撃成功率 (ASR) を向上させます。）。まず、UAP ジェネレーターをメタ学習するための一般的なモデルに依存しないメタ学習 (MAML) フレームワークを検討します。ただし、MAML フレームワークは画像ソース全体に普遍的な攻撃を直接提供しないことがわかり、L2O の別のメタ学習フレームワークと統合する必要があります。結果として得られる UAP ジェネレーターのメタ学習スキームは、(i) Projected Gradient Descent などのベースラインよりも優れたパフォーマンス (50% 高い ASR) を持ち、(ii) 通常の L2O および MAML フレームワークよりも優れたパフォーマンス (37% 速い) を持っています ( (iii) さまざまな被害者モデルと画像データソースの UAP 生成を同時に処理できます。

Adversarial perturbations are critical for certifying the robustness of deep learning models. A universal adversarial perturbation (UAP) can simultaneously attack multiple images, and thus offers a more unified threat model, obviating an image-wise attack algorithm. However, the existing UAP generator is underdeveloped when images are drawn from different image sources (e.g., with different image resolutions). Towards an authentic universality across image sources, we take a novel view of UAP generation as a customized instance of few-shot learning, which leverages bilevel optimization and learning-to-optimize (L2O) techniques for UAP generation with improved attack success rate (ASR). We begin by considering the popular model agnostic meta-learning (MAML) framework to meta-learn a UAP generator. However, we see that the MAML framework does not directly offer the universal attack across image sources, requiring us to integrate it with another meta-learning framework of L2O. The resulting scheme for meta-learning a UAP generator (i) has better performance (50% higher ASR) than baselines such as Projected Gradient Descent, (ii) has better performance (37% faster) than the vanilla L2O and MAML frameworks (when applicable), and (iii) is able to simultaneously handle UAP generation for different victim models and image data sources.

updated: Wed Aug 17 2022 23:00:11 GMT+0000 (UTC)

published: Tue Sep 29 2020 01:23:20 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト