Scaleable input gradient regularization for adversarial robustness

Chris Finlay; Adam M Oberman

敵対的堅牢性のためのスケーラブルな入力勾配正則化

この作業では、いくつかの新しい成分を使用して、敵対者の堅牢性のために勾配正則化を再検討します。最初に、局所的な勾配情報に基づいて、新しい画像ごとの理論的な堅牢性の範囲を導き出します。これらの境界は、入力勾配の正則化を強く動機付けます。次に、二重逆伝搬を回避する入力勾配正則化のスケーラブルバージョンを実装します。敵対的に堅牢なImageNetモデルは、4つのコンシューマグレードGPUで33時間でトレーニングされます。最後に、入力勾配の正則化が敵のトレーニングと競合することを実験的および理論的な証明を通じて示します。さらに、勾配の正則化は、勾配の難読化または勾配マスキングをもたらさないことを実証します。

In this work we revisit gradient regularization for adversarial robustness with some new ingredients. First, we derive new per-image theoretical robustness bounds based on local gradient information. These bounds strongly motivate input gradient regularization. Second, we implement a scaleable version of input gradient regularization which avoids double backpropagation: adversarially robust ImageNet models are trained in 33 hours on four consumer grade GPUs. Finally, we show experimentally and through theoretical certification that input gradient regularization is competitive with adversarial training. Moreover we demonstrate that gradient regularization does not lead to gradient obfuscation or gradient masking.

updated: Fri Oct 04 2019 14:12:34 GMT+0000 (UTC)

published: Mon May 27 2019 19:40:52 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト