Explainability of the Implications of Supervised and Unsupervised Face Image Quality Estimations Through Activation Map Variation Analyses in Face Recognition Models

Biying Fu; Naser Damer

顔認識モデルにおけるアクティベーションマップ変動分析による教師ありおよび教師なし顔画像品質推定の意味の説明可能性

教師なしまたは統計ベースの顔の画像品質評価（FIQA）手法の説明可能性を導き出すことは困難です。この作業では、さまざまなFIQA決定の理由と、それらの顔認識（FR）パフォーマンスへの影響を導き出すための、説明可能性ツールの新しいセットを提案します。さまざまなFIQA決定でサンプルを処理するときのFRモデルの動作に基づいて分析を行うことにより、ツールの展開を特定のFIQAメソッドに制限することを回避します。これは、顔の埋め込みから派生したネットワークのアクティベーションを示すために、アクティベーションマッピングを使用して、CNNベースのFRソリューションでFIQAメソッドに適用できる説明可能性ツールにつながります。 FRモデルの低品質画像と高品質画像の一般的な空間活性化マッピング間の低い識別を回避するために、さまざまな品質決定を使用した画像セットのFR活性化マップの変動を分析することにより、高微分空間で説明可能性ツールを構築します。 FIQA間およびFIQA内のメソッド分析を提示することにより、ツールを示し、4つのFIQAメソッドの結果を分析します。私たちが提案したツールとそれらに基づく分析は、他の結論の中でもとりわけ、高品質の画像は通常、中央の顔領域の外側の領域で一貫して低い活性化を引き起こし、低品質の画像は一般的に低い活性化にもかかわらず、大きな変動があることを指摘していますそのような領域での活性化の。私たちの説明可能性ツールは、単一の画像の分析にも拡張され、低品質の画像は、FRモデルの空間的活性化が、高品質の画像から予想されるものとは大きく異なる傾向があることを示しています。この違いは、外部の領域でもより多く現れる傾向があります。中央の顔の領域であり、極端なポーズや顔の閉塞などの問題に対応しています。提案されたツールの実装は、ここ[リンク]からアクセスできます。

It is challenging to derive explainability for unsupervised or statistical-based face image quality assessment (FIQA) methods. In this work, we propose a novel set of explainability tools to derive reasoning for different FIQA decisions and their face recognition (FR) performance implications. We avoid limiting the deployment of our tools to certain FIQA methods by basing our analyses on the behavior of FR models when processing samples with different FIQA decisions. This leads to explainability tools that can be applied for any FIQA method with any CNN-based FR solution using activation mapping to exhibit the network's activation derived from the face embedding. To avoid the low discrimination between the general spatial activation mapping of low and high-quality images in FR models, we build our explainability tools in a higher derivative space by analyzing the variation of the FR activation maps of image sets with different quality decisions. We demonstrate our tools and analyze the findings on four FIQA methods, by presenting inter and intra-FIQA method analyses. Our proposed tools and the analyses based on them point out, among other conclusions, that high-quality images typically cause consistent low activation on the areas outside of the central face region, while low-quality images, despite general low activation, have high variations of activation in such areas. Our explainability tools also extend to analyzing single images where we show that low-quality images tend to have an FR model spatial activation that strongly differs from what is expected from a high-quality image where this difference also tends to appear more in areas outside of the central face region and does correspond to issues like extreme poses and facial occlusions. The implementation of the proposed tools is accessible here [link].

updated: Thu Dec 09 2021 10:52:36 GMT+0000 (UTC)

published: Thu Dec 09 2021 10:52:36 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト