Amicable Aid: Turning Adversarial Attack to Benefit Classification

Juyeop Kim; Jun-Ho Choi; Soobeom Jang; Jong-Seok Lee

友好的な援助：敵対的攻撃を利益分類に変える

深い画像分類モデルに対する敵対的攻撃は実際には深刻なセキュリティ上の懸念を引き起こしますが、このペーパーでは、敵対的攻撃の概念が分類パフォーマンスに役立つ可能性がある新しいパラダイムを提案します。これを友好的な支援と呼びます。摂動の逆の探索方向を取ることにより、画像を別の画像に変換でき、分類モデルによってより高い信頼性が得られ、誤って分類された画像でも正しく分類できることを示します。さらに、摂動が大きいと、画像はモデルによって正しく認識されますが、人間の目では認識できなくなる可能性があります。友好的な援助のメカニズムは、基礎となる自然画像多様体の観点から説明されています。また、普遍的な友好的な摂動を考慮します。つまり、固定摂動を複数の画像に適用して、分類結果を改善することができます。このような摂動を見つけることは困難ですが、修正されたデータを使用したトレーニングによって画像多様体に対して可能な限り垂直に決定境界を作成することは、普遍的な友好的な摂動をより簡単に見つけることができるモデルを取得するのに効果的であることを示します。最後に、安全な画像通信、プライバシーを保護する画像通信、敵対的な攻撃からの保護など、友好的な支援が役立つ可能性のあるいくつかのアプリケーションシナリオについて説明します。

While adversarial attacks on deep image classification models pose serious security concerns in practice, this paper suggests a novel paradigm where the concept of adversarial attacks can benefit classification performance, which we call amicable aid. We show that by taking the opposite search direction of perturbation, an image can be converted to another yielding higher confidence by the classification model and even a wrongly classified image can be made to be correctly classified. Furthermore, with a large amount of perturbation, an image can be made unrecognizable by human eyes, while it is correctly recognized by the model. The mechanism of the amicable aid is explained in the viewpoint of the underlying natural image manifold. We also consider universal amicable perturbations, i.e., a fixed perturbation can be applied to multiple images to improve their classification results. While it is challenging to find such perturbations, we show that making the decision boundary as perpendicular to the image manifold as possible via training with modified data is effective to obtain a model for which universal amicable perturbations are more easily found. Finally, we discuss several application scenarios where the amicable aid can be useful, including secure image communication, privacy-preserving image communication, and protection against adversarial attacks.

updated: Thu Dec 09 2021 06:16:08 GMT+0000 (UTC)

published: Thu Dec 09 2021 06:16:08 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト