Image Counterfactual Sensitivity Analysis for Detecting Unintended Bias

Emily Denton; Ben Hutchinson; Margaret Mitchell; Timnit Gebru; Andrew Zaldivar

意図しないバイアスを検出するための画像の反事実感度分析

顔分析モデルは、認証から監視追跡に至るまで、人々の生活に深刻な影響を与えるアプリケーションでますます使用されています。したがって、顔の分析技術の倫理的な使用を導くのに役立つ、顔の分類器の意図しないバイアスを明らかにすることができる技術を開発することが重要です。この作品は、画像の反事実感度分析と呼ばれるフレームワークを提案します。これは、有名人の顔で訓練された笑顔の属性分類子を分析する際の概念実証として検討します。フレームワークは、反事実を利用して、顔の特性がわずかに変化した場合に分類器の予測がどのように変化するかを調べます。生成的敵対的ネットワークの最近の進歩を活用して、特定の画像特性の制御された操作を可能にする顔画像の現実的な生成モデルを構築します。次に、トレーニングされた分類器の出力に対する特定のプロパティの操作の影響を測定する一連のメトリックを紹介します。経験的に、笑顔の分類器の予測に影響を与える変動のいくつかの異なる要因を見つけます。この概念実証は、生成モデルを活用してバイアスと公平性をきめ細かく分析するための潜在的な方法を示しています。

Facial analysis models are increasingly used in applications that have serious impacts on people's lives, ranging from authentication to surveillance tracking. It is therefore critical to develop techniques that can reveal unintended biases in facial classifiers to help guide the ethical use of facial analysis technology. This work proposes a framework called image counterfactual sensitivity analysis, which we explore as a proof-of-concept in analyzing a smiling attribute classifier trained on faces of celebrities. The framework utilizes counterfactuals to examine how a classifier's prediction changes if a face characteristic slightly changes. We leverage recent advances in generative adversarial networks to build a realistic generative model of face images that affords controlled manipulation of specific image characteristics. We then introduce a set of metrics that measure the effect of manipulating a specific property on the output of the trained classifier. Empirically, we find several different factors of variation that affect the predictions of the smiling classifier. This proof-of-concept demonstrates potential ways generative models can be leveraged for fine-grained analysis of bias and fairness.

updated: Sat Oct 03 2020 21:33:55 GMT+0000 (UTC)

published: Fri Jun 14 2019 23:50:04 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト