Sahar Abdelnabi; Mario Fritz

「箱の中身は？！」：敵対的に互いに素なモデルをランダムに展開することで敵対的攻撃をそらす

"What's in the box?!": Deflecting Adversarial Attacks by Randomly Deploying Adversarially-Disjoint Models

機械学習モデルは現在、実際のアプリケーションに広く導入されています。しかし、敵対的な例の存在は、そのようなモデルに対する本当の脅威と長い間考えられてきました。堅牢性の向上を目的とした多くの防御策が提案されていますが、多くは効果がないことが示されています。これらの脆弱性はまだ解消されていないため、従来のホワイトボックスおよびブラックボックスの脅威モデルを超える、代替の展開ベースの防御パラダイムを提案します。単一の部分的にロバストなモデルをトレーニングする代わりに、同じ機能のセットをトレーニングすることができますが、中間の攻撃の転送可能性を最小限に抑えて、敵対的に互いに素なモデルをトレーニングできます。これらのモデルは、ランダムかつ個別に展開できるため、モデルの1つにアクセスしても他のモデルへの影響は最小限に抑えられます。 CIFAR-10とさまざまな攻撃に関する実験では、アンサンブルの多様性のベースラインと比較して、互いに素なモデル全体で大幅に低い攻撃転送可能性を達成していることが示されています。さらに、敵対的に訓練されたセットと比較して、クリーンな例の精度を維持しながら、より高い平均ロバスト精度を達成します。

Machine learning models are now widely deployed in real-world applications. However, the existence of adversarial examples has been long considered a real threat to such models. While numerous defenses aiming to improve the robustness have been proposed, many have been shown ineffective. As these vulnerabilities are still nowhere near being eliminated, we propose an alternative deployment-based defense paradigm that goes beyond the traditional white-box and black-box threat models. Instead of training a single partially-robust model, one could train a set of same-functionality, yet, adversarially-disjoint models with minimal in-between attack transferability. These models could then be randomly and individually deployed, such that accessing one of them minimally affects the others. Our experiments on CIFAR-10 and a wide range of attacks show that we achieve a significantly lower attack transferability across our disjoint models compared to a baseline of ensemble diversity. In addition, compared to an adversarially trained set, we achieve a higher average robust accuracy while maintaining the accuracy of clean examples.

updated: Tue Mar 09 2021 13:53:52 GMT+0000 (UTC)

published: Tue Feb 09 2021 20:07:13 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト