Towards A Unified Min-Max Framework for Adversarial Exploration and   Robustness

Jingkang Wang; Tianyun Zhang; Sijia Liu; Pin-Yu Chen; Jiacen Xu; Makan Fardad; Bo Li

敵の探査とロバスト性のための統一されたMin-Maxフレームワークに向けて

Towards A Unified Min-Max Framework for Adversarial Exploration and Robustness

最大の敵の損失を最小化する最悪の場合のトレーニングの原則は、敵対トレーニング（AT）としても知られ、標準ボールで制限された入力摂動に対する敵対的な堅牢性を強化するための最先端のアプローチであることが示されています。それにもかかわらず、ATの目的を超えたmin-max最適化は、敵の攻撃と防御の研究では厳密に調査されていません。特に、リスクソース（ドメイン）のセットが与えられた場合、ドメインセットから誘導される最大損失を最小化することは、ATとは異なる一般的な最小-最大問題として再定式化できます。この一般的な定式化の例には、モデルアンサンブルの攻撃、複数の入力またはデータ変換の下での普遍的な摂動の考案、さまざまなタイプの攻撃モデルに対する一般化されたATが含まれます。これらの問題は、統一された理論的に原理化されたmin-max最適化フレームワークの下で解決できることを示します。また、この方法から学習した自己調整ドメインの重みは、複数のドメインにわたる攻撃と防御の難易度を説明する手段を提供することも示しています。広範な実験により、このアプローチが従来の平均化戦略よりも大幅にパフォーマンスを向上させることが示されています。

The worst-case training principle that minimizes the maximal adversarial loss, also known as adversarial training (AT), has shown to be a state-of-the-art approach for enhancing adversarial robustness against norm-ball bounded input perturbations. Nonetheless, min-max optimization beyond the purpose of AT has not been rigorously explored in the research of adversarial attack and defense. In particular, given a set of risk sources (domains), minimizing the maximal loss induced from the domain set can be reformulated as a general min-max problem that is different from AT. Examples of this general formulation include attacking model ensembles, devising universal perturbation under multiple inputs or data transformations, and generalized AT over different types of attack models. We show that these problems can be solved under a unified and theoretically principled min-max optimization framework. We also show that the self-adjusted domain weights learned from our method provides a means to explain the difficulty level of attack and defense over multiple domains. Extensive experiments show that our approach leads to substantial performance improvement over the conventional averaging strategy.

updated: Sun Sep 29 2019 15:49:55 GMT+0000 (UTC)

published: Sun Jun 09 2019 04:32:13 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト