Push Stricter to Decide Better: A Class-Conditional Feature Adaptive Framework for Improving Adversarial Robustness

Jia-Li Yin; Lehui Xie; Wanqing Zhu; Ximeng Liu; Bo-Hao Chen

より良い決定のためのより厳密なプッシュ：敵対的なロバスト性を改善するためのクラス条件付き機能適応フレームワーク

敵対的な例の脅威に対応して、敵対的なトレーニングは、オンラインで拡張された敵対的な例でモデルをトレーニングすることにより、モデルの堅牢性を強化するための魅力的なオプションを提供します。ただし、既存の敵対的訓練方法のほとんどは、敵対的例を強化することによってロバストな精度を向上させることに焦点を当てていますが、自然データと敵対的例の間のシフトの増加を無視し、自然な精度の劇的な低下につながります。自然精度とロバスト精度の間のトレードオフを維持するために、機能適応の観点からのシフトを軽減し、自然データと敵対的な例全体でクラス条件付き機能適応を最適化する機能適応敵対トレーニング（FAAT）を提案します。具体的には、クラス条件付き弁別器を組み込んで、機能が（1）クラス弁別的であり、（2）敵対的攻撃の変化に対して不変になるようにすることを提案します。新しいFAATフレームワークは、自然データと敵対データにわたって同様の分布を持つ特徴を生成することにより、自然精度とロバスト精度の間のトレードオフを可能にし、クラス識別機能特性の恩恵を受けて、より高い全体的なロバスト性を実現します。さまざまなデータセットでの実験は、FAATがより識別力のある機能を生成し、最先端の方法に対して有利に機能することを示しています。コードはhttps://github.com/VisionFlow/FAATで入手できます。

In response to the threat of adversarial examples, adversarial training provides an attractive option for enhancing the model robustness by training models on online-augmented adversarial examples. However, most of the existing adversarial training methods focus on improving the robust accuracy by strengthening the adversarial examples but neglecting the increasing shift between natural data and adversarial examples, leading to a dramatic decrease in natural accuracy. To maintain the trade-off between natural and robust accuracy, we alleviate the shift from the perspective of feature adaption and propose a Feature Adaptive Adversarial Training (FAAT) optimizing the class-conditional feature adaption across natural data and adversarial examples. Specifically, we propose to incorporate a class-conditional discriminator to encourage the features become (1) class-discriminative and (2) invariant to the change of adversarial attacks. The novel FAAT framework enables the trade-off between natural and robust accuracy by generating features with similar distribution across natural and adversarial data, and achieve higher overall robustness benefited from the class-discriminative feature characteristics. Experiments on various datasets demonstrate that FAAT produces more discriminative features and performs favorably against state-of-the-art methods. Codes are available at https://github.com/VisionFlow/FAAT.

updated: Wed Dec 01 2021 07:37:56 GMT+0000 (UTC)

published: Wed Dec 01 2021 07:37:56 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト