Group-based Robustness: A General Framework for Customized Robustness in the Real World

Weiran Lin; Keane Lucas; Neo Eyal; Lujo Bauer; Michael K. Reiter; Mahmood Sharif

グループベースの堅牢性: 現実世界におけるカスタマイズされた堅牢性のための一般的なフレームワーク

機械学習モデルは、モデル入力を混乱させて誤分類を誘発する回避攻撃に対して脆弱であることが知られています。この研究では、既存の攻撃では真の脅威を正確に評価できない現実のシナリオを特定します。具体的には、ターゲットとターゲット外の堅牢性を測定する従来の指標は、あるソースクラスのセットから別のターゲットクラスのセットへの攻撃に耐えるモデルの能力を適切に反映していないことがわかりました。既存の手法の欠点に対処するために、グループベースの堅牢性と呼ばれる新しい指標を正式に定義します。これは、既存の指標を補完し、特定の攻撃シナリオでのモデルのパフォーマンスを評価するのにより適しています。私たちは、グループベースの堅牢性により、従来の堅牢性の指標が適用されない状況において、特定の脅威モデルに対するモデルの脆弱性を区別できることを経験的に示しています。さらに、グループベースの堅牢性を効率的かつ正確に測定するために、1) 2 つの損失関数を提案し、2) 3 つの新しい攻撃戦略を特定します。私たちは、同等の成功率で、新しい損失関数を使用して回避サンプルを見つけると、ターゲットとなるクラスの数と同じくらい大きな計算量を節約し、新しい攻撃戦略を使用して回避サンプルを見つけると、ブルートと比較して時間を最大 99% 節約できることを経験的に示しています。 -検索メソッドを強制します。最後に、グループベースの堅牢性を最大 3.52 倍向上させる防御方法を提案します。

Machine-learning models are known to be vulnerable to evasion attacks that perturb model inputs to induce misclassifications. In this work, we identify real-world scenarios where the true threat cannot be assessed accurately by existing attacks. Specifically, we find that conventional metrics measuring targeted and untargeted robustness do not appropriately reflect a model's ability to withstand attacks from one set of source classes to another set of target classes. To address the shortcomings of existing methods, we formally define a new metric, termed group-based robustness, that complements existing metrics and is better-suited for evaluating model performance in certain attack scenarios. We show empirically that group-based robustness allows us to distinguish between models' vulnerability against specific threat models in situations where traditional robustness metrics do not apply. Moreover, to measure group-based robustness efficiently and accurately, we 1) propose two loss functions and 2) identify three new attack strategies. We show empirically that with comparable success rates, finding evasive samples using our new loss functions saves computation by a factor as large as the number of targeted classes, and finding evasive samples using our new attack strategies saves time by up to 99% compared to brute-force search methods. Finally, we propose a defense method that increases group-based robustness by up to 3.52×.

updated: Mon Sep 04 2023 14:54:17 GMT+0000 (UTC)

published: Thu Jun 29 2023 01:07:12 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト