TARA: Training and Representation Alteration for AI Fairness and Domain Generalization

William Paul; Armin Hadzic; Neil Joshi; Fady Alajaji; Phil Burlina

TARA: AI の公平性と領域の一般化のためのトレーニングと表現の変更

保護された要素または敏感な要素に関して AI の公平性を強化するための新しい方法を提案します。この方法は、AI バイアスの顕著な原因を軽減するために、トレーニングと表現変更 (TARA) を実行するデュアル戦略を使用します。これには、次のものが含まれます。 a) 保護されたデータ表現のバイアス誘導依存性を抑制するための敵対的独立による表現学習変更の使用要因; b) ドメイン適応と潜在空間操作を介して過小評価された人口に関連する敏感な要因を細かく制御できる生成モデルを使用して、バイアスの原因となるデータの不均衡に対処するためのインテリジェントな拡張によるトレーニングセットの変更。画像分析で私たちの方法をテストするとき、実験は、TARAがベースラインモデルを大幅にまたは完全に緩和する一方で、同じ量の情報を持つ競合するバイアス緩和方法よりも優れていることを示しています。・EyePACS のベースラインメソッドのスコア (71.8、10.5)、および CelebA の (73.7、11.8) 対 (69.1、21.7)。さらに、バイアス緩和性能を評価するために使用される現在のメトリックの特定の制限を認識して、新しい結合的バイアス緩和メトリックを提案します。私たちの実験は、提案された方法のパレート効率を評価する際のこれらの新しいメトリックの能力も示しています。

We propose a novel method for enforcing AI fairness with respect to protected or sensitive factors. This method uses a dual strategy performing training and representation alteration (TARA) for the mitigation of prominent causes of AI bias by including: a) the use of representation learning alteration via adversarial independence to suppress the bias-inducing dependence of the data representation from protected factors; and b) training set alteration via intelligent augmentation to address bias-causing data imbalance, by using generative models that allow the fine control of sensitive factors related to underrepresented populations via domain adaptation and latent space manipulation. When testing our methods on image analytics, experiments demonstrate that TARA significantly or fully debiases baseline models while outperforming competing debiasing methods that have the same amount of information, e.g., with (% overall accuracy, % accuracy gap) = (78.8, 0.5) vs. the baseline method's score of (71.8, 10.5) for EyePACS, and (73.7, 11.8) vs. (69.1, 21.7) for CelebA. Furthermore, recognizing certain limitations in current metrics used for assessing debiasing performance, we propose novel conjunctive debiasing metrics. Our experiments also demonstrate the ability of these novel metrics in assessing the Pareto efficiency of the proposed methods.

updated: Fri Aug 20 2021 14:37:50 GMT+0000 (UTC)

published: Fri Dec 11 2020 14:39:10 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト