Debiasing Vision-Language Models via Biased Prompts

Ching-Yao Chuang; Varun Jampani; Yuanzhen Li; Antonio Torralba; Stefanie Jegelka

偏ったプロンプトによる視覚言語モデルの偏りの解消

機械学習モデルは、トレーニングデータセットからバイアスを継承することが示されています。これは、インターネットからスクレイピングされたキュレーションされていないデータセットでトレーニングされた視覚言語基盤モデルにとって特に問題になる可能性があります。バイアスは増幅され、ゼロショット分類器やテキストから画像への生成モデルなどのダウンストリームアプリケーションに伝播される可能性があります。この研究では、テキスト埋め込みで偏った方向を投影することにより、視覚言語基盤モデルの偏りをなくすための一般的なアプローチを提案します。特に、キャリブレーションされた射影行列を使用したテキスト埋め込みのみのバイアス緩和で、堅牢な分類子と公正な生成モデルを生成するのに十分であることを示します。閉じた形式のソリューションにより、大規模なパイプラインへの簡単な統合が可能になり、経験的な結果は、追加のデータやトレーニングを必要とせずに、差別的および生成的な視覚言語モデルの両方で、当社のアプローチが社会的偏見と偽の相関関係を効果的に削減することを示しています。

Machine learning models have been shown to inherit biases from their training datasets, which can be particularly problematic for vision-language foundation models trained on uncurated datasets scraped from the internet. The biases can be amplified and propagated to downstream applications like zero-shot classifiers and text-to-image generative models. In this study, we propose a general approach for debiasing vision-language foundation models by projecting out biased directions in the text embedding. In particular, we show that debiasing only the text embedding with a calibrated projection matrix suffices to yield robust classifiers and fair generative models. The closed-form solution enables easy integration into large-scale pipelines, and empirical results demonstrate that our approach effectively reduces social bias and spurious correlation in both discriminative and generative vision-language models without the need for additional data or training.

updated: Tue Jan 31 2023 20:09:33 GMT+0000 (UTC)

published: Tue Jan 31 2023 20:09:33 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト