Distribution-aware Fairness Test Generation

Sai Sathiesh Rajan; Ezekiel Soremekun; Yves Le Traon; Sudipta Chattopadhyay

ディストリビューションを意識した公平性テストの生成

この研究では、画像認識ソフトウェアにおけるグループの公平性を検証する方法を取り上げます。私たちは、配布外 (OOD) テストと意味を保持する画像の突然変異の相乗効果を利用して、画像分類器におけるクラスレベルの公平性違反を体系的に明らかにする、配布を意識した公平性テストのアプローチ (DistroFair と呼ばれます) を提案します。 DistroFair は、一連の画像内のオブジェクトの分布 (例: 数/方向) を自動的に学習します。次に、意味を保持する 3 つの画像の突然変異 (オブジェクトの削除、オブジェクトの挿入、オブジェクトの回転) を使用して、画像内のオブジェクトを体系的に突然変異させて OOD にします。私たちは、2 つのよく知られたデータセット (CityScapes と MS-COCO) と 3 つの主要な商用画像認識ソフトウェア (つまり、Amazon Rekognition、Google Cloud Vision、Azure Computer Vision) を使用して DistroFair を評価します。結果は、DistroFair によって生成された画像の約 21% が、グラウンドトゥルースまたはメタモーフィックオラクルを使用してクラスレベルの公平性違反を明らかにしていることを示しています。 DistroFair は、2 つの主要なベースライン、つまり、(a) ディストリビューション (ID) 内のみの画像生成に焦点を当てたアプローチ、および (b) 元の画像データセットのみを使用した公平性分析よりも最大 2.3 倍効果的です。さらに、DistroFair は効率的であり、1 時間あたり平均 460 枚の画像を生成していることも観察されました。最後に、DistroFair によって生成された 30 枚の実際の画像と、対応する 30 枚の変異画像を使用した、81 人の参加者によるユーザー調査を通じて、アプローチの意味論的な妥当性を評価します。 DistroFair によって生成された画像は、現実世界の画像と比べて 80% リアルであることがわかりました。

This work addresses how to validate group fairness in image recognition software. We propose a distribution-aware fairness testing approach (called DistroFair) that systematically exposes class-level fairness violations in image classifiers via a synergistic combination of out-of-distribution (OOD) testing and semantic-preserving image mutation. DistroFair automatically learns the distribution (e.g., number/orientation) of objects in a set of images. Then it systematically mutates objects in the images to become OOD using three semantic-preserving image mutations -- object deletion, object insertion and object rotation. We evaluate DistroFair using two well-known datasets (CityScapes and MS-COCO) and three major, commercial image recognition software (namely, Amazon Rekognition, Google Cloud Vision and Azure Computer Vision). Results show that about 21% of images generated by DistroFair reveal class-level fairness violations using either ground truth or metamorphic oracles. DistroFair is up to 2.3x more effective than two main baselines, i.e., (a) an approach which focuses on generating images only within the distribution (ID) and (b) fairness analysis using only the original image dataset. We further observed that DistroFair is efficient, it generates 460 images per hour, on average. Finally, we evaluate the semantic validity of our approach via a user study with 81 participants, using 30 real images and 30 corresponding mutated images generated by DistroFair. We found that images generated by DistroFair are 80% as realistic as real-world images.

updated: Tue Jun 27 2023 08:45:06 GMT+0000 (UTC)

published: Mon May 08 2023 08:38:22 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト