Overwriting Pretrained Bias with Finetuning Data

Angelina Wang; Olga Russakovsky

事前トレーニングされたバイアスを微調整データで上書きする

転移学習は、大規模なデータセットで事前トレーニングされたモデルの表現機能を、より小規模でドメイン固有のデータセットのターゲットタスクに合わせて微調整できるため、有益です。ただし、これらの事前トレーニングされたモデルには独自のバイアスが付いており、それが微調整されたモデルに伝播する可能性があるという懸念があります。この研究では、ターゲットタスクと機密属性の間の誤った相関関係と、データセット内の特定のグループの過小評価の両方として概念化されたバイアスを調査します。バイアスの両方の概念の下で、(1) 事前トレーニングされたモデルの上に微調整されたモデルは実際にそのバイアスを継承する可能性があるが、(2) このバイアスは微調整データセットへの比較的小規模な介入によって修正でき、多くの場合影響は無視できることがわかります。パフォーマンスへ。私たちの調査結果は、下流タスクのバイアスを軽減するには微調整データセットを慎重にキュレーションすることが重要であり、そうすることで事前トレーニング済みモデルのバイアスを補償することもできることを示唆しています。

Transfer learning is beneficial by allowing the expressive features of models pretrained on large-scale datasets to be finetuned for the target task of smaller, more domain-specific datasets. However, there is a concern that these pretrained models may come with their own biases which would propagate into the finetuned model. In this work, we investigate bias when conceptualized as both spurious correlations between the target task and a sensitive attribute as well as underrepresentation of a particular group in the dataset. Under both notions of bias, we find that (1) models finetuned on top of pretrained models can indeed inherit their biases, but (2) this bias can be corrected for through relatively minor interventions to the finetuning dataset, and often with a negligible impact to performance. Our findings imply that careful curation of the finetuning dataset is important for reducing biases on a downstream task, and doing so can even compensate for bias in the pretrained model.

updated: Thu Aug 17 2023 02:01:07 GMT+0000 (UTC)

published: Fri Mar 10 2023 19:10:58 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト