On Fragile Features and Batch Normalization in Adversarial Training

Nils Philipp Walter; David Stutz; Bernt Schiele

敵対的訓練における脆弱な特徴とバッチ正規化について

最新の深層学習アーキテクチャは、バッチ正規化（BN）を利用して、トレーニングを安定させ、精度を向上させます。 BN層だけでも驚くほど表現力があることが示されています。ただし、敵対的な例に対する堅牢性のコンテキストでは、BNは脆弱性を高めると主張されています。つまり、BNは壊れやすい機能を学習するのに役立ちます。それにもかかわらず、BNは依然として敵対訓練で使用されています。これは、堅牢な機能を学習するための事実上の標準です。敵対的訓練におけるBNの役割を明らかにするために、ランダムな特徴と比較して、壊れやすい特徴を堅牢にするためにBNの表現力をどの程度使用できるかを調査します。 CIFAR10では、BNレイヤーだけを敵対的に微調整すると、敵対的な堅牢性が重要になる可能性があることがわかりました。対照的に、BNレイヤーのみを最初から敵対的に訓練することは、意味のある敵対的な頑健性を伝えることができません。私たちの結果は、脆弱な特徴を使用して中程度の敵対的ロバスト性を備えたモデルを学習できるが、ランダムな特徴は使用できないことを示しています

Modern deep learning architecture utilize batch normalization (BN) to stabilize training and improve accuracy. It has been shown that the BN layers alone are surprisingly expressive. In the context of robustness against adversarial examples, however, BN is argued to increase vulnerability. That is, BN helps to learn fragile features. Nevertheless, BN is still used in adversarial training, which is the de-facto standard to learn robust features. In order to shed light on the role of BN in adversarial training, we investigate to what extent the expressiveness of BN can be used to robustify fragile features in comparison to random features. On CIFAR10, we find that adversarially fine-tuning just the BN layers can result in non-trivial adversarial robustness. Adversarially training only the BN layers from scratch, in contrast, is not able to convey meaningful adversarial robustness. Our results indicate that fragile features can be used to learn models with moderate adversarial robustness, while random features cannot

updated: Tue Apr 26 2022 15:49:33 GMT+0000 (UTC)

published: Tue Apr 26 2022 15:49:33 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト