Adversarial Image Color Transformations in Explicit Color Filter Space

Zhengyu Zhao; Zhuoran Liu; Martha Larson

明示的なカラーフィルター空間での敵対的な画像の色変換

ディープニューラルネットワークは、敵対的な画像に対して脆弱であることがわかっています。従来の攻撃は、厳密に制限された摂動を伴う、区別できない敵対的な画像を狙っています。最近、研究者たちは、識別可能でありながら不審ではない敵対的な画像の探索に着手し、色変換攻撃が効果的であることを実証しました。この研究では、単純なカラーフィルターのパラメーター空間の勾配情報で最適化された新しい色変換攻撃である Adversarial Color Filter (AdvCF) を提案します。特に、カラーフィルター空間は明示的に指定されているため、攻撃と防御の両方の観点から、敵対的なカラー変換に対するモデルの堅牢性を体系的に分析できます。対照的に、既存の色変換攻撃では、そのような明示的な空間が欠如しているため、体系的な分析の機会が提供されません。さらに、画像分類器をだます際の AdvCF の有効性を実証し、広範なユーザー調査を通じて防御に対する堅牢性と画像の受容性に関して他の色変換攻撃と比較します。また、AdvCF の人間による解釈可能性にも焦点を当て、画像の受容性と効率性の両方において、人間が解釈できる最先端の色変換攻撃よりも AdvCF が優れていることを示します。追加の結果は、別の 3 つの視覚的タスクにおける AdvCF に対するモデルの堅牢性に関する興味深い新しい洞察を提供します。

Deep Neural Networks have been shown to be vulnerable to adversarial images. Conventional attacks strive for indistinguishable adversarial images with strictly restricted perturbations. Recently, researchers have moved to explore distinguishable yet non-suspicious adversarial images and demonstrated that color transformation attacks are effective. In this work, we propose Adversarial Color Filter (AdvCF), a novel color transformation attack that is optimized with gradient information in the parameter space of a simple color filter. In particular, our color filter space is explicitly specified so that we are able to provide a systematic analysis of model robustness against adversarial color transformations, from both the attack and defense perspectives. In contrast, existing color transformation attacks do not offer the opportunity for systematic analysis due to the lack of such an explicit space. We further demonstrate the effectiveness of our AdvCF in fooling image classifiers and also compare it with other color transformation attacks regarding their robustness to defenses and image acceptability through an extensive user study. We also highlight the human-interpretability of AdvCF and show its superiority over the state-of-the-art human-interpretable color transformation attack on both image acceptability and efficiency. Additional results provide interesting new insights into model robustness against AdvCF in another three visual tasks.

updated: Fri Jun 16 2023 10:19:13 GMT+0000 (UTC)

published: Thu Nov 12 2020 23:51:37 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト