Considerations on the Evaluation of Biometric Quality Assessment Algorithms

Torsten Schlett; Christian Rathgeb; Juan Tapia; Christoph Busch

生体認証品質評価アルゴリズムの評価に関する考慮事項

品質評価アルゴリズムを使用して、バイオメトリック認識のためのバイオメトリックサンプルの有用性を推定できます。 "Error vs Discard Characteristic" (EDC) プロット、およびその中の曲線の "partial Area Under Curve" (pAUC) 値は、一般に研究者がこのような品質評価アルゴリズムの予測性能を評価するために使用します。 EDC 曲線は、「False Non Match Rate」(FNMR)、品質評価アルゴリズム、生体認証システム、生体認証サンプルペアにそれぞれ対応する一連の比較、および対応する比較スコアしきい値などのエラータイプに依存します。起動エラー。 EDC 曲線を計算するために、関連するサンプルの最低の品質スコアに基づいて比較が徐々に破棄され、残りの比較の誤差が計算されます。さらに、pAUC 値を計算するには、破棄割合の制限または範囲を選択する必要があります。これを使用して、品質評価アルゴリズムを定量的にランク付けできます。このホワイトペーパーでは、この種の品質評価アルゴリズムの評価に関するさまざまな詳細について説明し、分析します。これには、一般的な EDC プロパティ、ハード下限エラー限界とソフト上限エラー限界に基づく pAUC 値の解釈可能性の改善、個別のランキングではなく相対的なランク付けの使用、段階的な評価が含まれます。対線形曲線補間、および [0, 100] 整数範囲への品質スコアの正規化。また、さまざまな pAUC 廃棄率制限と開始エラーにわたる pAUC 値に基づいて、定量的品質評価アルゴリズムランキングの安定性を分析し、より高い pAUC 廃棄率制限が優先されるべきであると結論付けました。分析は、合成データと顔画像品質評価シナリオの実際のデータの両方で行われ、EDC 評価の一般的なモダリティに依存しない結論に焦点を当てています。

Quality assessment algorithms can be used to estimate the utility of a biometric sample for the purpose of biometric recognition. "Error versus Discard Characteristic" (EDC) plots, and "partial Area Under Curve" (pAUC) values of curves therein, are generally used by researchers to evaluate the predictive performance of such quality assessment algorithms. An EDC curve depends on an error type such as the "False Non Match Rate" (FNMR), a quality assessment algorithm, a biometric recognition system, a set of comparisons each corresponding to a biometric sample pair, and a comparison score threshold corresponding to a starting error. To compute an EDC curve, comparisons are progressively discarded based on the associated samples' lowest quality scores, and the error is computed for the remaining comparisons. Additionally, a discard fraction limit or range must be selected to compute pAUC values, which can then be used to quantitatively rank quality assessment algorithms. This paper discusses and analyses various details for this kind of quality assessment algorithm evaluation, including general EDC properties, interpretability improvements for pAUC values based on a hard lower error limit and a soft upper error limit, the use of relative instead of discrete rankings, stepwise vs. linear curve interpolation, and normalisation of quality scores to a [0, 100] integer range. We also analyse the stability of quantitative quality assessment algorithm rankings based on pAUC values across varying pAUC discard fraction limits and starting errors, concluding that higher pAUC discard fraction limits should be preferred. The analyses are conducted both with synthetic data and with real data for a face image quality assessment scenario, with a focus on general modality-independent conclusions for EDC evaluations.

updated: Thu Mar 23 2023 14:26:21 GMT+0000 (UTC)

published: Thu Mar 23 2023 14:26:21 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト