Silent Killer: Optimizing Backdoor Trigger Yields a Stealthy and Powerful Data Poisoning Attack

Tzvi Lederer; Gallil Maimon; Lior Rokach

サイレントキラー: バックドアトリガーを最適化すると、ステルスで強力なデータポイズニング攻撃が発生します

データポイズニング (DP) に基づく、ニューラルネットワークに対するステルスで強力なバックドア攻撃を提案します。以前の攻撃とは対照的に、私たちの方法の毒とトリガーの両方がステルスです。サンプルのモデルの分類を、ソースクラスから攻撃者が選択したターゲットクラスに変更できます。これは、ラベルを変更せずに、摂動がほとんど知覚できない少数の汚染されたトレーニングサンプルを使用することによって行います。推論時には、攻撃されたサンプルに追加されたステルス摂動をトリガーとして使用します。この摂動は Universal adversarial perturbation (UAP) として作成され、毒はこのトリガーに結合された勾配アライメントを使用して作成されます。私たちの方法は、以前の方法と比較して作成時間が非常に効率的であり、追加の再トレーニングなしでトレーニング済みの代理モデルのみを必要とします。私たちの攻撃は、クリーンなサンプルで高い精度を維持しながら、攻撃成功率の点で最先端の結果を達成しています。

We propose a stealthy and powerful backdoor attack on neural networks based on data poisoning (DP). In contrast to previous attacks, both the poison and the trigger in our method are stealthy. We are able to change the model's classification of samples from a source class to a target class chosen by the attacker. We do so by using a small number of poisoned training samples with nearly imperceptible perturbations, without changing their labels. At inference time, we use a stealthy perturbation added to the attacked samples as a trigger. This perturbation is crafted as a universal adversarial perturbation (UAP), and the poison is crafted using gradient alignment coupled to this trigger. Our method is highly efficient in crafting time compared to previous methods and requires only a trained surrogate model without additional retraining. Our attack achieves state-of-the-art results in terms of attack success rate while maintaining high accuracy on clean samples.

updated: Thu Jan 05 2023 15:11:05 GMT+0000 (UTC)

published: Thu Jan 05 2023 15:11:05 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト