FaceGuard: A Self-Supervised Defense Against Adversarial Face Images

Debayan Deb; Xiaoming Liu; Anil K. Jain

FaceGuard：敵対的な顔画像に対する自己監視型防御

敵対的顔画像に対する一般的な防衛機制は、トレーニングセット内の敵対的摂動に適合しすぎて、目に見えない敵対的攻撃に一般化できない傾向があります。事前に計算された敵対者トレーニングサンプルを利用せずに、多種多様な敵対者の顔を自動的に検出、ローカライズ、および浄化できる、新しい自己監視型の敵対的防御フレームワーク、つまりFaceGuardを提案します。トレーニング中に、FaceGuardは挑戦的で多様な敵対的攻撃を自動的に合成し、分類器がそれらを実際の顔と区別することを学習できるようにし、浄化装置は画像空間の敵対的摂動を除去しようとします。 LFWデータセットの実験結果は、FaceGuardが6つの目に見えない敵対的攻撃タイプで99.81％の検出精度を達成できることを示しています。さらに、提案された方法は、ArcFaceの顔認識性能を防御なしの34.27％TAR @ 0.1％FARから77.46％TAR @ 0.1％FARに向上させることができます。

Prevailing defense mechanisms against adversarial face images tend to overfit to the adversarial perturbations in the training set and fail to generalize to unseen adversarial attacks. We propose a new self-supervised adversarial defense framework, namely FaceGuard, that can automatically detect, localize, and purify a wide variety of adversarial faces without utilizing pre-computed adversarial training samples. During training, FaceGuard automatically synthesizes challenging and diverse adversarial attacks, enabling a classifier to learn to distinguish them from real faces and a purifier attempts to remove the adversarial perturbations in the image space. Experimental results on LFW dataset show that FaceGuard can achieve 99.81% detection accuracy on six unseen adversarial attack types. In addition, the proposed method can enhance the face recognition performance of ArcFace from 34.27% TAR @ 0.1% FAR under no defense to 77.46% TAR @ 0.1% FAR.

updated: Mon Apr 05 2021 20:37:56 GMT+0000 (UTC)

published: Sat Nov 28 2020 21:18:46 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト