Can we achieve robustness from data alone?

Nikolaos Tsilivis; Jingtong Su; Julia Kempe

データだけで堅牢性を実現できますか？

敵対的訓練とその変種は、ニューラルネットワークを使用して敵対的にロバストな分類を達成するための一般的な方法になりました。ただし、計算コストの増加と、標準パフォーマンスと堅牢なパフォーマンスの間の大きなギャップが進行を妨げ、より良い結果が得られるかどうかという疑問を投げかけています。この作業では、一歩下がって質問します。モデルは、適切に最適化されたセットでの標準トレーニングを介して堅牢性を実現できますか？この目的のために、ロバストな分類のためのメタ学習方法を考案します。これは、原則的な方法で展開する前にデータセットを最適化し、データのロバストでない部分を効果的に削除することを目的としています。最適化手法をカーネル回帰のマルチステップPGDプロシージャとしてキャストし、無限に広いニューラルネット（ニューラルタンジェントカーネル-NTK）を記述するカーネルのクラスを使用します。 MNISTとCIFAR-10での実験は、カーネル回帰分類器とニューラルネットワークの両方にデプロイされた場合、私たちが生成するデータセットがPGD攻撃に対して非常に高い堅牢性を享受することを示しています。ただし、代替攻撃がモデルをだますことができるため、この堅牢性はやや誤りです。これは、以前の同様の文献の研究にも当てはまります。これの潜在的な理由について説明し、研究のさらなる道を概説します。

Adversarial training and its variants have come to be the prevailing methods to achieve adversarially robust classification using neural networks. However, its increased computational cost together with the significant gap between standard and robust performance hinder progress and beg the question of whether we can do better. In this work, we take a step back and ask: Can models achieve robustness via standard training on a suitably optimized set? To this end, we devise a meta-learning method for robust classification, that optimizes the dataset prior to its deployment in a principled way, and aims to effectively remove the non-robust parts of the data. We cast our optimization method as a multi-step PGD procedure on kernel regression, with a class of kernels that describe infinitely wide neural nets (Neural Tangent Kernels - NTKs). Experiments on MNIST and CIFAR-10 demonstrate that the datasets we produce enjoy very high robustness against PGD attacks, when deployed in both kernel regression classifiers and neural networks. However, this robustness is somewhat fallacious, as alternative attacks manage to fool the models, which we find to be the case for previous similar works in the literature as well. We discuss potential reasons for this and outline further avenues of research.

updated: Sun Jul 24 2022 12:14:48 GMT+0000 (UTC)

published: Sun Jul 24 2022 12:14:48 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト