Improving Model Generalization by On-manifold Adversarial Augmentation in the Frequency Domain

Chang Liu; Wenzhao Xiang; Yuan He; Hui Xue; Shibao Zheng; Hang Su

周波数領域での多様体上の敵対的増強によるモデルの一般化の改善

ディープニューラルネットワーク (DNN) は、トレーニングデータとテストデータの基になる分布が異なる場合、パフォーマンスが大幅に低下する可能性があります。分布外 (OOD) データに対するモデルの一般化の重要性にもかかわらず、OOD データに対する最先端 (SOTA) モデルの精度は急落する可能性があります。最近の研究では、データ拡張の特殊なケースとして、通常または非多様な敵対的例を使用して、OOD の一般化を改善できることが実証されています。これに触発されて、多様体上の敵対的な例がOODの一般化に役立つことを理論的に証明します。それにもかかわらず、実際の多様体は一般に複雑であるため、多様体上の敵対的な例を生成することは自明ではありません。この問題に対処するために、Wavelet モジュール (AdvWavAug) を介して敵対的な例でデータを増強する新しい方法を提案しました。これは、実装が簡単な多様体上の敵対的データ増強手法です。特に、無害な画像をウェーブレットドメインに射影します。ウェーブレット変換のスパース特性の助けを借りて、推定されたデータ多様体で画像を変更できます。 AdvPropトレーニングフレームワークに基づいて敵対的拡張を行います。 ImageNet とその歪んだバージョンを含む、さまざまなモデルとさまざまなデータセットでの広範な実験は、特に OOD データで、モデルの一般化を改善できることを示しています。 AdvWavAug をトレーニングプロセスに統合することで、最近のトランスフォーマーベースのモデルで SOTA の結果を達成しました。

Deep neural networks (DNNs) may suffer from significantly degenerated performance when the training and test data are of different underlying distributions. Despite the importance of model generalization to out-of-distribution (OOD) data, the accuracy of state-of-the-art (SOTA) models on OOD data can plummet. Recent work has demonstrated that regular or off-manifold adversarial examples, as a special case of data augmentation, can be used to improve OOD generalization. Inspired by this, we theoretically prove that on-manifold adversarial examples can better benefit OOD generalization. Nevertheless, it is nontrivial to generate on-manifold adversarial examples because the real manifold is generally complex. To address this issue, we proposed a novel method of Augmenting data with Adversarial examples via a Wavelet module (AdvWavAug), an on-manifold adversarial data augmentation technique that is simple to implement. In particular, we project a benign image into a wavelet domain. With the assistance of the sparsity characteristic of wavelet transformation, we can modify an image on the estimated data manifold. We conduct adversarial augmentation based on AdvProp training framework. Extensive experiments on different models and different datasets, including ImageNet and its distorted versions, demonstrate that our method can improve model generalization, especially on OOD data. By integrating AdvWavAug into the training process, we have achieved SOTA results on some recent transformer-based models.

updated: Sun Jun 09 2024 03:32:58 GMT+0000 (UTC)

published: Tue Feb 28 2023 04:31:09 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト