Improving Transferability of Adversarial Patches on Face Recognition with Generative Models

Zihao Xiao; Xianfeng Gao; Chilin Fu; Yinpeng Dong; Wei Gao; Xiaolu Zhang; Jun Zhou; Jun Zhu

生成モデルによる顔認識での敵対パッチの転送可能性の改善

顔認識は、深い畳み込みニューラルネットワーク（CNN）によって大幅に改善されます。最近、これらの顔認識モデルは、セキュリティに敏感なアプリケーションのID認証に使用されています。ただし、ディープCNNは、物理的に実現可能でステルスな敵対的なパッチに対して脆弱であり、これらのモデルの実際のアプリケーションに新たなセキュリティ上の懸念を引き起こします。この論文では、攻撃者がターゲットモデルへのアクセスを制限している場合に、転送可能性に基づいて敵対パッチを使用して顔認識モデルの堅牢性を評価します。まず、既存の転送ベースの攻撃手法を拡張して、転送可能な敵対パッチを生成します。ただし、転送可能性は初期化に敏感であり、摂動の大きさが大きいと低下することがわかります。これは、代替モデルへの過剰適合を示しています。次に、低次元のデータ多様体上の敵対的なパッチを正規化することを提案します。多様体は、正当な人間の顔の画像で事前にトレーニングされた生成モデルによって表されます。マニフォールドの最適化による敵対的摂動として顔のような特徴を使用して、代替モデルとターゲットモデルの応答間のギャップが劇的に減少し、より良い転送可能性を示すことを示します。ブラックボックス設定における提案された方法の優位性を実証するために、広範なデジタル世界実験が実施されます。提案手法を実世界にも適用します。

Face recognition is greatly improved by deep convolutional neural networks (CNNs). Recently, these face recognition models have been used for identity authentication in security sensitive applications. However, deep CNNs are vulnerable to adversarial patches, which are physically realizable and stealthy, raising new security concerns on the real-world applications of these models. In this paper, we evaluate the robustness of face recognition models using adversarial patches based on transferability, where the attacker has limited accessibility to the target models. First, we extend the existing transfer-based attack techniques to generate transferable adversarial patches. However, we observe that the transferability is sensitive to initialization and degrades when the perturbation magnitude is large, indicating the overfitting to the substitute models. Second, we propose to regularize the adversarial patches on the low dimensional data manifold. The manifold is represented by generative models pre-trained on legitimate human face images. Using face-like features as adversarial perturbations through optimization on the manifold, we show that the gaps between the responses of substitute models and the target models dramatically decrease, exhibiting a better transferability. Extensive digital world experiments are conducted to demonstrate the superiority of the proposed method in the black-box setting. We apply the proposed method in the physical world as well.

updated: Mon Aug 30 2021 03:59:50 GMT+0000 (UTC)

published: Tue Jun 29 2021 02:13:05 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト