FRT-PAD: Effective Presentation Attack Detection Driven by Face Related Task

Wentian Zhang; Haozhe Liu; Feng Liu; Raghavendra Ramachandra; Christoph Busch

FRT-PAD: 顔関連タスクによる効果的なプレゼンテーション攻撃検出

顔認識システム (FRS) のセキュリティを確保するには、プレゼンテーション攻撃検出 (PAD) メソッドの堅牢性と一般化機能が不可欠です。ただし、実際のシナリオでは、プレゼンテーション攻撃 (PA) はさまざまであり、攻撃者が使用するプレゼンテーション攻撃手段 (PAI) の種類を予測することは困難です。既存の PAD メソッドは、限られたトレーニングセットに大きく依存しており、未知の PAI 種にうまく一般化できません。この特定の PAD タスクとは異なり、膨大な量の実際の顔によって訓練された他の顔関連のタスク (顔認識や属性編集など) は、さまざまなアプリケーションシナリオに効果的に採用できます。これに着想を得て、PAD の位置と顔関連の作業を顔システムで交換し、顔関連のタスクから自由に獲得した事前知識を顔 PAD の解決に適用して、PA の検出における一般化能力を向上させることを提案します。提案された方法は、最初に他の顔関連タスクからタスク固有の機能を導入し、次にGraph Attention Network（GAT）を使用してクロスモーダルアダプターを設計し、そのような機能を再マッピングしてPADタスクに適応させます。最後に、顔 PAD は、CNN ベースの PA 検出器からの階層的特徴と再マッピングされた特徴を使用して実現されます。実験結果は、最先端の方法と比較した場合、提案された方法が複雑でハイブリッドなデータセットで大幅な改善を達成できることを示しています。特に、データセット OULU-NPU、CASIA-FASD、および Idiap Replay-Attack でトレーニングすると、テストデータセット MSU-MFSD で 5.48% の HTER (Half Total Error Rate) が得られ、ベースラインを 7.39% 上回っています。

The robustness and generalization ability of Presentation Attack Detection (PAD) methods is critical to ensure the security of Face Recognition Systems (FRSs). However, in a real scenario, Presentation Attacks (PAs) are various and it is hard to predict the Presentation Attack Instrument (PAI) species that will be used by the attacker. Existing PAD methods are highly dependent on the limited training set and cannot generalize well to unknown PAI species. Unlike this specific PAD task, other face related tasks trained by huge amount of real faces (e.g. face recognition and attribute editing) can be effectively adopted into different application scenarios. Inspired by this, we propose to trade position of PAD and face related work in a face system and apply the free acquired prior knowledge from face related tasks to solve face PAD, so as to improve the generalization ability in detecting PAs. The proposed method, first introduces task specific features from other face related task, then, we design a Cross-Modal Adapter using a Graph Attention Network (GAT) to re-map such features to adapt to PAD task. Finally, face PAD is achieved by using the hierarchical features from a CNN-based PA detector and the re-mapped features. The experimental results show that the proposed method can achieve significant improvements in the complicated and hybrid datasets, when compared with the state-of-the-art methods. In particular, when training on the datasets OULU-NPU, CASIA-FASD, and Idiap Replay-Attack, we obtain HTER (Half Total Error Rate) of 5.48% for the testing dataset MSU-MFSD, outperforming the baseline by 7.39%.

updated: Sun Jul 31 2022 10:34:30 GMT+0000 (UTC)

published: Mon Nov 22 2021 08:35:26 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト