Learning the Unlearnable: Adversarial Augmentations Suppress Unlearnable Example Attacks

Tianrui Qin; Xitong Gao; Juanjuan Zhao; Kejiang Ye; Cheng-Zhong Xu

学習不可能なものを学習する: 敵対的増強は学習不可能な攻撃の例を抑制します

学習不能な攻撃の例は、深層学習モデルのトレーニングのために公開データを不正使用から保護するために使用できるデータポイズニング手法です。これらの方法は、元の画像にステルスな摂動を追加するため、ディープラーニングモデルがこれらのトレーニングデータから効果的に学習することが困難になります。現在の調査によると、敵対的トレーニングは、学習不可能な例の攻撃の影響をある程度軽減できますが、一般的なデータ拡張方法はそのような毒に対して効果的ではありません.ただし、敵対的トレーニングはかなりの計算リソースを必要とし、重要な精度の損失をもたらす可能性があります。このホワイトペーパーでは、効果的なデータ拡張ポリシーと損失を最大化する敵対的拡張の組み合わせにより、さまざまなタイプの最先端の学習不可能な攻撃の例に対する現在の防御よりも優れた UEraser メソッドを紹介します。現在の SOTA 敵対的トレーニング方法とはまったく対照的に、UEraser は敵対的増強を使用します。これは、現在の非学習攻撃と防御によって想定される ℓ_p 摂動バジェットの範囲を超えて拡張されます。また、モデルの一般化能力を向上させ、精度の低下を防ぎます。 UEraser は、エラーを最大化するデータ拡張によって未学習の影響を一掃し、トレーニングされたモデルの精度を復元します。興味深いことに、敵対的な拡張を行わない高速な亜種である UEraser-Lite も、クリーンな精度を維持するのに非常に効果的です。さまざまな攻撃で生成された、学習不可能な CIFAR-10、CIFAR-100、SVHN、および ImageNet サブセットデータセットに挑戦することで、クリーントレーニング中に得られた結果に匹敵する結果を達成します。また、適応型攻撃の可能性に対するその有効性も示しています。私たちのコードはオープンソースであり、深層学習コミュニティ (https://github.com/lafeat/ueraser) で利用できます。

Unlearnable example attacks are data poisoning techniques that can be used to safeguard public data against unauthorized use for training deep learning models. These methods add stealthy perturbations to the original image, thereby making it difficult for deep learning models to learn from these training data effectively. Current research suggests that adversarial training can, to a certain degree, mitigate the impact of unlearnable example attacks, while common data augmentation methods are not effective against such poisons. Adversarial training, however, demands considerable computational resources and can result in non-trivial accuracy loss. In this paper, we introduce the UEraser method, which outperforms current defenses against different types of state-of-the-art unlearnable example attacks through a combination of effective data augmentation policies and loss-maximizing adversarial augmentations. In stark contrast to the current SOTA adversarial training methods, UEraser uses adversarial augmentations, which extends beyond the confines of ℓ_p perturbation budget assumed by current unlearning attacks and defenses. It also helps to improve the model's generalization ability, thus protecting against accuracy loss. UEraser wipes out the unlearning effect with error-maximizing data augmentations, thus restoring trained model accuracies. Interestingly, UEraser-Lite, a fast variant without adversarial augmentations, is also highly effective in preserving clean accuracies. On challenging unlearnable CIFAR-10, CIFAR-100, SVHN, and ImageNet-subset datasets produced with various attacks, it achieves results that are comparable to those obtained during clean training. We also demonstrate its efficacy against possible adaptive attacks. Our code is open source and available to the deep learning community: https://github.com/lafeat/ueraser.

updated: Mon Mar 27 2023 12:00:54 GMT+0000 (UTC)

published: Mon Mar 27 2023 12:00:54 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト