Towards Understanding the Generative Capability of Adversarially Robust Classifiers

Yao Zhu; Jiacheng Ma; Jiacheng Sun; Zewei Chen; Rongxin Jiang; Zhenguo Li

敵対的にロバストな分類器の生成能力の理解に向けて

最近、いくつかの作品は、敵対的にロバストな分類器が生成モデルに匹敵する優れた画像を生成できるという興味深い現象を発見しました。この現象をエネルギーの観点から調査し、新しい説明を提供します。エネルギー関数の観点から、敵対的な例の生成、敵対的なトレーニング、および画像の生成を再定式化します。敵対的なトレーニングは、生成能力の鍵となる、実際のデータの周囲でフラットでエネルギーが低いエネルギー関数の取得に寄与することがわかります。私たちの新しい理解に基づいて、より優れた敵対訓練方法である共同エネルギー敵対訓練（JEAT）をさらに提案します。これは、高品質の画像を生成し、幅広い攻撃の下で新しい最先端の堅牢性を実現できます。 JEATによって生成された画像（CIFAR-10）の開始スコアは8.80であり、元の堅牢な分類器（7.50）よりもはるかに優れています。

Recently, some works found an interesting phenomenon that adversarially robust classifiers can generate good images comparable to generative models. We investigate this phenomenon from an energy perspective and provide a novel explanation. We reformulate adversarial example generation, adversarial training, and image generation in terms of an energy function. We find that adversarial training contributes to obtaining an energy function that is flat and has low energy around the real data, which is the key for generative capability. Based on our new understanding, we further propose a better adversarial training method, Joint Energy Adversarial Training (JEAT), which can generate high-quality images and achieve new state-of-the-art robustness under a wide range of attacks. The Inception Score of the images (CIFAR-10) generated by JEAT is 8.80, much better than original robust classifiers (7.50).

updated: Tue Sep 14 2021 11:06:28 GMT+0000 (UTC)

published: Fri Aug 20 2021 10:13:15 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト