Robustness of classifiers to universal perturbations: a geometric perspective

Seyed-Mohsen Moosavi-Dezfooli; Alhussein Fawzi; Omar Fawzi; Pascal Frossard; Stefano Soatto

普遍的な摂動に対する分類器のロバスト性：幾何学的観点

深いネットワークは、普遍的な摂動に対して脆弱であることが最近示されています。非常に小さな画像にとらわれない摂動が存在し、ほとんどの自然画像がそのような分類器によって誤って分類される原因になります。この論文では、普遍的な摂動に対する分類器のロバスト性の最初の定量分析を提案し、普遍的な摂動に対するロバスト性と決定境界の幾何学との間の形式的なリンクを描きます。具体的には、2つの決定境界モデル（フラットモデルとカーブモデル）の下で分類器のロバスト性に関する理論的限界を確立します。特に、普遍的な摂動に対する深いネットワークのロバスト性は、それらの曲率の重要な特性によって駆動されることを示します。深いネットワークの決定境界が体系的に正に湾曲する共通の方向が存在します。そのような条件下で、我々は小さな普遍的な摂動の存在を証明します。私たちの分析はさらに、それらの特性を説明することに加えて、普遍的な摂動を計算するための新しい幾何学的方法を提供します。

Deep networks have recently been shown to be vulnerable to universal perturbations: there exist very small image-agnostic perturbations that cause most natural images to be misclassified by such classifiers. In this paper, we propose the first quantitative analysis of the robustness of classifiers to universal perturbations, and draw a formal link between the robustness to universal perturbations, and the geometry of the decision boundary. Specifically, we establish theoretical bounds on the robustness of classifiers under two decision boundary models (flat and curved models). We show in particular that the robustness of deep networks to universal perturbations is driven by a key property of their curvature: there exists shared directions along which the decision boundary of deep networks is systematically positively curved. Under such conditions, we prove the existence of small universal perturbations. Our analysis further provides a novel geometric method for computing universal perturbations, in addition to explaining their properties.

updated: Mon Mar 01 2021 20:45:00 GMT+0000 (UTC)

published: Fri May 26 2017 12:42:55 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト