Do Perceptually Aligned Gradients Imply Adversarial Robustness?

Roy Ganz; Bahjat Kawar; Michael Elad

知覚的に整列した勾配は、敵対的ロバスト性を意味しますか?

敵対的にロバストな分類器には、ロバストでないモデルにはない特性、つまり、知覚的に整列した勾配 (PAG) があります。入力に対する勾配は、人間の知覚とよく一致します。いくつかの研究では、PAG が堅牢なトレーニングの副産物であると特定されていますが、PAG をスタンドアロンの現象と見なしたり、独自の意味を研究したりしたものはありません。この作業では、この特性に焦点を当て、知覚的に整列したグラデーションがロバスト性を意味するかどうかをテストします。この目的のために、分類器のトレーニングでPAGを直接促進するという新しい目的を開発し、そのような勾配を持つモデルが敵対的攻撃に対してより堅牢かどうかを調べます。複数のデータセットとアーキテクチャに関する広範な実験により、勾配が整列したモデルが大幅な堅牢性を示すことが検証され、PAG と堅牢性の間の驚くべき双方向の関係が明らかになりました。最後に、より良い勾配アライメントがロバスト性の向上につながり、この観察結果を利用して既存の敵対的トレーニング手法のロバスト性を高めることを示します。

Adversarially robust classifiers possess a trait that non-robust models do not -- Perceptually Aligned Gradients (PAG). Their gradients with respect to the input align well with human perception. Several works have identified PAG as a byproduct of robust training, but none have considered it as a standalone phenomenon nor studied its own implications. In this work, we focus on this trait and test whether Perceptually Aligned Gradients imply Robustness. To this end, we develop a novel objective to directly promote PAG in training classifiers and examine whether models with such gradients are more robust to adversarial attacks. Extensive experiments on multiple datasets and architectures validate that models with aligned gradients exhibit significant robustness, exposing the surprising bidirectional connection between PAG and robustness. Lastly, we show that better gradient alignment leads to increased robustness and harness this observation to boost the robustness of existing adversarial training techniques.

updated: Wed Feb 01 2023 12:24:57 GMT+0000 (UTC)

published: Fri Jul 22 2022 23:48:26 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト