Parsimonious Black-Box Adversarial Attacks via Efficient Combinatorial Optimization

Seungyong Moon; Gaon An; Hyun Oh Song

効率的な組み合わせ最適化による倹約的なブラックボックス敵対的攻撃

射影された勾配降下法を使用して敵対的な例を解くことは、ニューラルネットワークベースの分類器をだますのに非常に効果的であることが実証されています。ただし、ブラックボックス設定では、攻撃者はネットワークへのクエリアクセスのみに制限され、成功した敵対的な例を解決することははるかに困難になります。この目的のために、最近の方法は、入力クエリに基づいて真の勾配信号を推定することを目的としていますが、過剰なクエリを犠牲にしています。勾配の推定を必要とせず、その結果、調整する一次更新ハイパーパラメーターから解放される、最適化問題に対する効率的な離散サロゲートを提案します。 Cifar-10 と ImageNet での実験では、最近提案された多くの方法と比較して、必要なクエリが大幅に削減された最先端のブラックボックス攻撃のパフォーマンスが示されました。ソースコードは https://github.com/snu-mllab/parsimonious-blackbox-attack で入手できます。

Solving for adversarial examples with projected gradient descent has been demonstrated to be highly effective in fooling the neural network based classifiers. However, in the black-box setting, the attacker is limited only to the query access to the network and solving for a successful adversarial example becomes much more difficult. To this end, recent methods aim at estimating the true gradient signal based on the input queries but at the cost of excessive queries. We propose an efficient discrete surrogate to the optimization problem which does not require estimating the gradient and consequently becomes free of the first order update hyperparameters to tune. Our experiments on Cifar-10 and ImageNet show the state of the art black-box attack performance with significant reduction in the required queries compared to a number of recently proposed methods. The source code is available at https://github.com/snu-mllab/parsimonious-blackbox-attack.

updated: Tue Oct 18 2022 15:48:04 GMT+0000 (UTC)

published: Thu May 16 2019 10:14:20 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト