RobustBench: a standardized adversarial robustness benchmark

Francesco Croce; Maksym Andriushchenko; Vikash Sehwag; Edoardo Debenedetti; Nicolas Flammarion; Mung Chiang; Prateek Mittal; Matthias Hein

RobustBench：標準化された敵対的ロバストネスベンチマーク

研究コミュニティとして、私たちは敵対的なロバスト性の進歩についての体系的な理解をまだ欠いており、ロバストなモデルのトレーニングで最も有望なアイデアを特定するのが難しいことがよくあります。堅牢性のベンチマークにおける重要な課題は、その評価がエラーを起こしやすく、堅牢性の過大評価につながることです。私たちの目標は、敵対的なロバスト性の標準化されたベンチマークを確立することです。これは、妥当な計算予算内で検討対象のモデルのロバスト性を可能な限り正確に反映します。この目的のために、画像分類タスクを検討することから始め、許可されたモデルに制限（将来的に緩和される可能性があります）を導入します。ホワイトボックス攻撃とブラックボックス攻撃のアンサンブルであるAutoAttackを使用して、敵対的な堅牢性を評価します。これは、元の出版物と比較してほぼすべての堅牢性評価を改善することが最近大規模な調査で示されました。 AutoAttackに対する新しい防御の過剰適応を防ぐために、特にAutoAttackが堅牢性の潜在的な過大評価にフラグを立てる場合、適応攻撃に基づく外部評価を歓迎します。 https://robustbench.github.io/でホストされているリーダーボードには、120以上のモデルの評価が含まれており、ℓ_∞-およびℓ_2-の一連の明確に定義されたタスクの画像分類に現在の最先端技術を反映することを目的としています。脅威モデルと一般的な破損について、将来的に拡張される可能性があります。さらに、ライブラリhttps://github.com/RobustBench/robustbenchをオープンソース化して、80以上の堅牢なモデルへの統合アクセスを提供し、ダウンストリームアプリケーションを容易にします。最後に、収集されたモデルに基づいて、パフォーマンスに対する堅牢性が、分布シフト、キャリブレーション、分布外検出、公平性、プライバシー漏えい、滑らかさ、および転送可能性に与える影響を分析します。

As a research community, we are still lacking a systematic understanding of the progress on adversarial robustness which often makes it hard to identify the most promising ideas in training robust models. A key challenge in benchmarking robustness is that its evaluation is often error-prone leading to robustness overestimation. Our goal is to establish a standardized benchmark of adversarial robustness, which as accurately as possible reflects the robustness of the considered models within a reasonable computational budget. To this end, we start by considering the image classification task and introduce restrictions (possibly loosened in the future) on the allowed models. We evaluate adversarial robustness with AutoAttack, an ensemble of white- and black-box attacks, which was recently shown in a large-scale study to improve almost all robustness evaluations compared to the original publications. To prevent overadaptation of new defenses to AutoAttack, we welcome external evaluations based on adaptive attacks, especially where AutoAttack flags a potential overestimation of robustness. Our leaderboard, hosted at https://robustbench.github.io/, contains evaluations of 120+ models and aims at reflecting the current state of the art in image classification on a set of well-defined tasks in ℓ_∞- and ℓ_2-threat models and on common corruptions, with possible extensions in the future. Additionally, we open-source the library https://github.com/RobustBench/robustbench that provides unified access to 80+ robust models to facilitate their downstream applications. Finally, based on the collected models, we analyze the impact of robustness on the performance on distribution shifts, calibration, out-of-distribution detection, fairness, privacy leakage, smoothness, and transferability.

updated: Sun Oct 31 2021 20:03:39 GMT+0000 (UTC)

published: Mon Oct 19 2020 17:06:18 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト