On Success and Simplicity: A Second Look at Transferable Targeted Attacks

Zhengyu Zhao; Zhuoran Liu; Martha Larson

成功とシンプルさについて：転送可能な標的型攻撃の再検討

標的型攻撃の転送可能性を実現することは非常に難しいと言われています。現在、最先端のアプローチでは、追加のデータを使用して各ターゲットクラスのトレーニングモデルが必要になるため、リソースを大量に消費します。しかし、私たちの調査では、追加のデータもモデルのトレーニングも必要としない単純な転送可能な攻撃は、驚くほど高いターゲット転送可能性を達成できることがわかりました。この洞察は、主に攻撃の最適化を限られた回数の反復に不当に制限するという広範な慣行のために、これまで見過ごされてきました。特に、私たちは初めて、単純なロジットの損失が最先端技術との競争力のある結果を生み出すことができることを確認しました。私たちの分析は、特に3つの新しい現実的な設定を含む、さまざまな転送設定にまたがっています。モデルの類似性がほとんどないアンサンブル転送設定、ランクの低いターゲットクラスを使用した最悪の設定、Google CloudVisionに対する実際の攻撃です。 API。これらの新しい設定の結果は、一般的に採用されている簡単な設定では、さまざまな攻撃の実際の特性を完全に明らかにすることはできず、誤解を招く比較を引き起こす可能性があることを示しています。また、データやトレーニングを必要としない方法で、ターゲットを絞った普遍的な敵対的摂動を生成するための単純なロジット損失の有用性も示します。全体として、私たちの分析の目的は、対象となる移転可能性についてより意味のある評価を促すことです。コードはhttps://github.com/ZhengyuZhao/Targeted-Tansferで入手できます。

Achieving transferability of targeted attacks is reputed to be remarkably difficult. Currently, state-of-the-art approaches are resource-intensive because they necessitate training model(s) for each target class with additional data. In our investigation, we find, however, that simple transferable attacks which require neither additional data nor model training can achieve surprisingly high targeted transferability. This insight has been overlooked until now, mainly due to the widespread practice of unreasonably restricting attack optimization to a limited number of iterations. In particular, we, for the first time, identify that a simple logit loss can yield competitive results with the state of the arts. Our analysis spans a variety of transfer settings, especially including three new, realistic settings: an ensemble transfer setting with little model similarity, a worse-case setting with low-ranked target classes, and also a real-world attack against the Google Cloud Vision API. Results in these new settings demonstrate that the commonly adopted, easy settings cannot fully reveal the actual properties of different attacks and may cause misleading comparisons. We also show the usefulness of the simple logit loss for generating targeted universal adversarial perturbations in a data-free and training-free manner. Overall, the aim of our analysis is to inspire a more meaningful evaluation on targeted transferability. Code is available at https://github.com/ZhengyuZhao/Targeted-Tansfer

updated: Tue Oct 26 2021 20:12:06 GMT+0000 (UTC)

published: Mon Dec 21 2020 09:41:29 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト