Robustness and Generalization via Generative Adversarial Training

Omid Poursaeed; Tianxing Jiang; Harry Yang; Serge Belongie; SerNam Lim

生成的敵対的トレーニングによる堅牢性と一般化

ディープニューラルネットワークは、さまざまなコンピュータビジョンタスクで目覚ましい成功を収めていますが、新しいドメインや入力画像の微妙なバリエーションに一般化できないことがよくあります。これらの変動に対するロバスト性を改善するために、いくつかの防御策が提案されています。ただし、現在の防御はトレーニングで使用される特定の攻撃にしか耐えることができず、モデルは他の入力バリエーションに対して脆弱なままであることがよくあります。さらに、これらの方法は、クリーンな画像でのモデルのパフォーマンスを低下させることが多く、ドメイン外のサンプルに一般化されません。このホワイトペーパーでは、生成的敵対的トレーニングを紹介します。これは、モデルのテストセットとドメイン外サンプルへの一般化と、目に見えない敵対的攻撃に対する堅牢性を同時に改善するためのアプローチです。画像の低レベルの事前定義された側面を変更する代わりに、解きほぐされた潜在空間を持つ生成モデルを使用して、低レベル、中レベル、および高レベルの変化のスペクトルを生成します。これらの例を使用した敵対的なトレーニングにより、モデルはトレーニング中にさまざまな入力の変化を観察することで、さまざまな攻撃に耐えることができます。私たちのアプローチは、クリーンな画像やドメイン外のサンプルでのモデルのパフォーマンスを向上させるだけでなく、予期しない攻撃に対して堅牢にし、以前の作業よりも優れていることを示しています。分類、セグメンテーション、オブジェクト検出などのさまざまなタスクの結果を示すことにより、メソッドの有効性を検証します。

While deep neural networks have achieved remarkable success in various computer vision tasks, they often fail to generalize to new domains and subtle variations of input images. Several defenses have been proposed to improve the robustness against these variations. However, current defenses can only withstand the specific attack used in training, and the models often remain vulnerable to other input variations. Moreover, these methods often degrade performance of the model on clean images and do not generalize to out-of-domain samples. In this paper we present Generative Adversarial Training, an approach to simultaneously improve the model's generalization to the test set and out-of-domain samples as well as its robustness to unseen adversarial attacks. Instead of altering a low-level pre-defined aspect of images, we generate a spectrum of low-level, mid-level and high-level changes using generative models with a disentangled latent space. Adversarial training with these examples enable the model to withstand a wide range of attacks by observing a variety of input alterations during training. We show that our approach not only improves performance of the model on clean images and out-of-domain samples but also makes it robust against unforeseen attacks and outperforms prior work. We validate effectiveness of our method by demonstrating results on various tasks such as classification, segmentation and object detection.

updated: Mon Sep 06 2021 22:34:04 GMT+0000 (UTC)

published: Mon Sep 06 2021 22:34:04 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト