LTD: Low Temperature Distillation for Robust Adversarial Training

Erh-Chung Chen; Che-Rung Lee

LTD: 強力な敵対的トレーニングのための低温蒸留

敵対的トレーニングは、敵対的攻撃に対するニューラルネットワークモデルの堅牢性を強化するために広く使用されています。ニューラルネットワークモデルの人気にもかかわらず、これらのモデルの自然な精度と堅牢な精度の間には大きなギャップが存在します。この論文では、このギャップの主な理由の 1 つが、画像認識の学習プロセスを妨げるワンホットベクトルのラベルとしての一般的な使用であることを特定します。あいまいなイメージをワンホットベクトルで表現することは不正確であり、モデルが次善の解決策につながる可能性があります。この問題を克服するために、修正された知識蒸留フレームワークを使用してソフトラベルを生成する低温蒸留 (LTD) と呼ばれる新しい方法を提案します。以前のアプローチとは異なり、LTD では教師モデルでは比較的低い温度を使用し、教師モデルと生徒モデルでは固定ではあるが異なる温度を使用します。この変更により、防御的蒸留で対処されてきた勾配マスキングの問題に遭遇することなく、モデルの堅牢性が向上します。実験結果は、提案された LTD 手法と以前の手法を組み合わせた有効性を示しており、追加のラベルなしデータなしで、CIFAR-10、CIFAR-100、および ImageNet データセットに対してそれぞれ 58.19%、31.13%、および 42.08% の堅牢な精度率を達成しています。。

Adversarial training has been widely used to enhance the robustness of neural network models against adversarial attacks. Despite the popularity of neural network models, a significant gap exists between the natural and robust accuracy of these models. In this paper, we identify one of the primary reasons for this gap is the common use of one-hot vectors as labels, which hinders the learning process for image recognition. Representing ambiguous images with one-hot vectors is imprecise and may lead the model to suboptimal solutions. To overcome this issue, we propose a novel method called Low Temperature Distillation (LTD) that generates soft labels using the modified knowledge distillation framework. Unlike previous approaches, LTD uses a relatively low temperature in the teacher model and fixed, but different temperatures for the teacher and student models. This modification boosts the model's robustness without encountering the gradient masking problem that has been addressed in defensive distillation. The experimental results demonstrate the effectiveness of the proposed LTD method combined with previous techniques, achieving robust accuracy rates of 58.19%, 31.13%, and 42.08% on CIFAR-10, CIFAR-100, and ImageNet data sets, respectively, without additional unlabeled data.

updated: Fri Jun 30 2023 06:56:18 GMT+0000 (UTC)

published: Wed Nov 03 2021 16:26:00 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト