Sparse-RS: a versatile framework for query-efficient sparse black-box adversarial attacks

Francesco Croce; Maksym Andriushchenko; Naman D. Singh; Nicolas Flammarion; Matthias Hein

Sparse-RS：クエリ効率の高いスパースブラックボックスの敵対的攻撃のための多用途のフレームワーク

ブラックボックス設定でのスコアベースのスパースターゲット攻撃と非ターゲット攻撃のために、ランダム検索に基づく多用途のフレームワークであるSparse-RSを提案します。 Sparse-RSは、代替モデルに依存せず、複数のスパース攻撃モデル（l_0制限の摂動、敵対的なパッチ、および敵対的なフレーム）に対して、最先端の成功率とクエリ効率を実現します。ターゲットを絞らないSparse-RSのl_0バージョンは、MNIST、CIFAR-10、およびImageNetのさまざまなモデルに対して、すべてのブラックボックス攻撃およびすべてのホワイトボックス攻撃よりも優れています。さらに、ターゲットを絞らないSparse-RSは、20×20の敵対パッチと224×224の画像の2ピクセル幅の敵対フレームの難しい設定でも、非常に高い成功率を達成します。最後に、Sparse-RSを適用して、既存のアプローチを大幅に上回るターゲットを絞ったユニバーサル敵対パッチを生成できることを示します。フレームワークのコードはhttps://github.com/fra31/sparse-rsで入手できます。

We propose a versatile framework based on random search, Sparse-RS, for score-based sparse targeted and untargeted attacks in the black-box setting. Sparse-RS does not rely on substitute models and achieves state-of-the-art success rate and query efficiency for multiple sparse attack models: l_0-bounded perturbations, adversarial patches, and adversarial frames. The l_0-version of untargeted Sparse-RS outperforms all black-box and even all white-box attacks for different models on MNIST, CIFAR-10, and ImageNet. Moreover, our untargeted Sparse-RS achieves very high success rates even for the challenging settings of 20×20 adversarial patches and 2-pixel wide adversarial frames for 224×224 images. Finally, we show that Sparse-RS can be applied to generate targeted universal adversarial patches where it significantly outperforms the existing approaches. The code of our framework is available at https://github.com/fra31/sparse-rs.

updated: Tue Feb 08 2022 00:32:43 GMT+0000 (UTC)

published: Tue Jun 23 2020 08:50:37 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト