Bias Mimicking: A Simple Sampling Approach for Bias Mitigation

Maan Qraitem; Kate Saenko; Bryan A. Plummer

バイアスの模倣: バイアスを軽減するための単純なサンプリングアプローチ

以前の研究では、Visual Recognition データセットは、クラスラベル Y (プログラマーなど) 内のバイアスグループ B (女性など) を過小評価することが多いことが示されています。このデータセットバイアスは、クラスラベルとバイアスグループ (年齢、性別、人種など) の間の疑似相関を学習するモデルにつながる可能性があります。この問題に対処する最新の方法では、アーキテクチャを大幅に変更するか、損失関数を追加する必要があり、さらにハイパーパラメーターを調整する必要があります。別の方法として、クラスの不均衡に関する文献 (例: Undersampling、Upweighting) からのデータサンプリングベースラインは、多くの場合 1 行のコードで実装でき、ハイパーパラメーターを持たないことが多く、安価で効率的なソリューションを提供します。しかし、これらの方法には重大な欠点があります。たとえば、アンダーサンプリングは入力分布のかなりの部分を削除しますが、オーバーサンプリングはサンプルを繰り返し、オーバーフィッティングを引き起こします。これらの欠点に対処するために、新しいクラス条件付きサンプリング方法を導入します: バイアス模倣。この方法は、クラス c のバイアス分布、つまり P_D(B|Y=c) が c^\prime≠c ごとに模倣される場合、Y と B は統計的に独立しているという観察に基づいています。この概念を使用して、BM は、新しいトレーニング手順を通じて、サンプルを繰り返さずにモデルが分布全体にさらされるようにします。その結果、Bias Mimicking は、非サンプリング手法よりもパフォーマンスを維持し、場合によっては改善しながら、少数派グループのサンプリング手法の平均精度を 4 つのベンチマークで 3% 向上させます。コードは https://github.com/mqraitem/Bias-Mimicking にあります。

Prior work has shown that Visual Recognition datasets frequently underrepresent bias groups B (e.g. Female) within class labels Y (e.g. Programmers). This dataset bias can lead to models that learn spurious correlations between class labels and bias groups such as age, gender, or race. Most recent methods that address this problem require significant architectural changes or additional loss functions requiring more hyper-parameter tuning. Alternatively, data sampling baselines from the class imbalance literature (e.g. Undersampling, Upweighting), which can often be implemented in a single line of code and often have no hyperparameters, offer a cheaper and more efficient solution. However, these methods suffer from significant shortcomings. For example, Undersampling drops a significant part of the input distribution while Oversampling repeats samples, causing overfitting. To address these shortcomings, we introduce a new class conditioned sampling method: Bias Mimicking. The method is based on the observation that if a class c bias distribution, i.e. P_D(B|Y=c) is mimicked across every c^\prime≠c, then Y and B are statistically independent. Using this notion, BM, through a novel training procedure, ensures that the model is exposed to the entire distribution without repeating samples. Consequently, Bias Mimicking improves underrepresented groups average accuracy of sampling methods by 3% over four benchmarks while maintaining and sometimes improving performance over non sampling methods. Code can be found in https://github.com/mqraitem/Bias-Mimicking

updated: Mon Dec 05 2022 17:24:56 GMT+0000 (UTC)

published: Fri Sep 30 2022 17:33:00 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト