ALA: Adversarial Lightness Attack via Naturalness-aware Regularizations

Liangru Sun; Felix Juefei-Xu; Yihao Huang; Qing Guo; Jiayi Zhu; Jincao Feng; Yang Liu; Geguang Pu

ALA：自然性を意識した正則化による敵対的な明度攻撃

ほとんどの研究者は、特殊な敵対的例を使用してDNNの脆弱性を明らかにし、修復することによって、ディープニューラルネットワーク（DNN）の堅牢性を強化しようとしています。攻撃の例の一部には、Lpノルムによって制限された知覚できない摂動があります。ただし、それらの高周波特性のために、敵対的な例は通常、転送性が低く、ノイズ除去方法によって防御することができます。欠陥を回避するために、いくつかの作業は摂動を無制限にして、より良いロバスト性と転送可能性を獲得します。ただし、これらの例は通常不自然に見え、警備員に警告します。高画質で転送性の良い無制限の敵対的例を生成するために、本論文では、画像の明るさの変更に焦点を当てたホワイトボックスの無制限の敵対的攻撃である敵対的明度攻撃（ALA）を提案します。人間の知覚に不可欠なサンプルの形状と色は、ほとんど影響を受けません。高画質の敵対的な例を取得するために、自然性を意識した正則化を作成します。より強力な転送可能性を実現するために、攻撃手順でランダム初期化とノンストップ攻撃戦略を提案します。異なるタスクの2つの一般的なデータセット（つまり、画像分類用のImageNetとシーン認識用のPlaces-365）でALAの有効性を検証します。実験は、生成された敵対的な例が強力な転送可能性と高い画質の両方を持っていることを示しています。さらに、敵対的な例は、明度の破損を防御するための標準的なトレーニング済みResNet50の改善にも役立ちます。

Most researchers have tried to enhance the robustness of deep neural networks (DNNs) by revealing and repairing the vulnerability of DNNs with specialized adversarial examples. Parts of the attack examples have imperceptible perturbations restricted by Lp norm. However, due to their high-frequency property, the adversarial examples usually have poor transferability and can be defensed by denoising methods. To avoid the defects, some works make the perturbations unrestricted to gain better robustness and transferability. However, these examples usually look unnatural and alert the guards. To generate unrestricted adversarial examples with high image quality and good transferability, in this paper, we propose Adversarial Lightness Attack (ALA), a white-box unrestricted adversarial attack that focuses on modifying the lightness of the images. The shape and color of the samples, which are crucial to human perception, are barely influenced. To obtain adversarial examples with high image quality, we craft a naturalness-aware regularization. To achieve stronger transferability, we propose random initialization and non-stop attack strategy in the attack procedure. We verify the effectiveness of ALA on two popular datasets for different tasks (i.e., ImageNet for image classification and Places-365 for scene recognition). The experiments show that the generated adversarial examples have both strong transferability and high image quality. Besides, the adversarial examples can also help to improve the standard trained ResNet50 on defending lightness corruption.

updated: Sun Jan 16 2022 15:25:24 GMT+0000 (UTC)

published: Sun Jan 16 2022 15:25:24 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト