Improved robustness to adversarial examples using Lipschitz   regularization of the loss

Chris Finlay; Adam Oberman; Bilal Abbasi

損失のリプシッツ正則化を使用した敵対的な例に対する堅牢性の向上

Improved robustness to adversarial examples using Lipschitz regularization of the loss

敵対者訓練（AT）を最悪の敵対者訓練（WCAT）で補強し、CIFAR-10のℓ_2ノルムにおける現在の最先端の結果よりも敵対者の堅牢性を11％向上させます。損失の勾配のノルムの期待値と最大値に基づいて、検証可能な平均ケースと最悪ケースの堅牢性保証を取得します。敵対的トレーニングは、数学的画像処理の基本ツールであるTotal Variation Regularizationとして、WCATはLipschitz正則化として解釈します。

We augment adversarial training (AT) with worst case adversarial training (WCAT) which improves adversarial robustness by 11% over the current state-of-the-art result in the ℓ_2 norm on CIFAR-10. We obtain verifiable average case and worst case robustness guarantees, based on the expected and maximum values of the norm of the gradient of the loss. We interpret adversarial training as Total Variation Regularization, which is a fundamental tool in mathematical image processing, and WCAT as Lipschitz regularization.

updated: Fri Sep 13 2019 14:56:57 GMT+0000 (UTC)

published: Mon Oct 01 2018 20:02:00 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト