Improving Transformation-based Defenses against Adversarial Examples with First-order Perturbations

Haimin Zhang; Min Xu

一次摂動を伴う敵対的な例に対する変換ベースの防御の改善

ディープニューラルネットワークは、さまざまな機械学習タスクにうまく適用されています。ただし、研究によると、ニューラルネットワークは敵対的な攻撃を受けやすいことが示されています。これは、ニューラルネットワークベースのインテリジェントシステムに潜在的な脅威をさらします。予測されていないクラスラベルに対して生成された小さな一次摂動を敵対的な例に適用することにより、ニューラルネットワークによって出力される正しい結果の確率が増加することを観察します。この観察に基づいて、敵対的ロバスト性を改善するために敵対的摂動を打ち消す方法を提案します。提案された方法では、いくつかのクラスラベルをランダムに選択し、これらの選択されたラベルに対して小さな1次摂動を生成します。生成された摂動は一緒に追加され、指定されたスペースにクランプされます。得られた摂動は、最終的に敵対的な例に追加され、例に含まれる敵対的な摂動を打ち消す。提案された方法は推論時に適用され、モデルの再トレーニングや微調整を必要としません。 CIFAR-10およびCIFAR-100で提案された方法を実験的に検証します。結果は、特により多くの反復を使用して生成された強力な敵対的な例に対して、私たちの方法がいくつかの変換ベースの防御方法の防御パフォーマンスを効果的に改善することを示しています。

Deep neural networks have been successfully applied in various machine learning tasks. However, studies show that neural networks are susceptible to adversarial attacks. This exposes a potential threat to neural network-based intelligent systems. We observe that the probability of the correct result outputted by the neural network increases by applying small first-order perturbations generated for non-predicted class labels to adversarial examples. Based on this observation, we propose a method for counteracting adversarial perturbations to improve adversarial robustness. In the proposed method, we randomly select a number of class labels and generate small first-order perturbations for these selected labels. The generated perturbations are added together and then clamped onto a specified space. The obtained perturbation is finally added to the adversarial example to counteract the adversarial perturbation contained in the example. The proposed method is applied at inference time and does not require retraining or finetuning the model. We experimentally validate the proposed method on CIFAR-10 and CIFAR-100. The results demonstrate that our method effectively improves the defense performance of several transformation-based defense methods, especially against strong adversarial examples generated using more iterations.

updated: Mon May 10 2021 06:46:56 GMT+0000 (UTC)

published: Mon Mar 08 2021 06:27:24 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト