Causal Balancing for Domain Generalization

Xinyi Wang; Michael Saxon; Jiachen Li; Hongyang Zhang; Kun Zhang; William Yang Wang

ドメイン一般化のための因果的バランス

機械学習モデルは、さまざまな実世界のタスクで最先端技術を急速に進歩させますが、これらのモデルの疑似相関に対する脆弱性を考えると、ドメイン外（OOD）の一般化は依然として困難な問題です。観測された列車分布を疑似相関のない平衡分布に変換するために、因果的に動機付けられた平衡ミニバッチサンプリング戦略を提案します。このようなバランスの取れた分布でトレーニングされたベイズ最適分類器は、十分に多様な環境空間全体でミニマックス最適であると主張します。また、十分な数の列車環境を利用することにより、不変の因果メカニズムを備えた、提案された基礎となるデータ生成プロセスの潜在変数モデルの識別可能性の保証を提供します。実験は3つのドメイン一般化データセットで実施され、バランスの取れたミニバッチサンプリング戦略が、ランダムミニバッチサンプリング戦略と比較して、4つの異なる確立されたドメイン一般化モデルベースラインのパフォーマンスを改善することを経験的に示しています。

While machine learning models rapidly advance the state-of-the-art on various real-world tasks, out-of-domain (OOD) generalization remains a challenging problem given the vulnerability of these models to spurious correlations. We propose a causally-motivated balanced mini-batch sampling strategy to transform the observed train distribution to a balanced distribution that is free of spurious correlations. We argue that the Bayes optimal classifier trained on such balanced distribution is minimax optimal across a diverse enough environment space. We also provide an identifiability guarantee of the latent variable model of the proposed underlying data generation process with invariant causal mechanisms, by utilizing enough number of train environments. Experiments are conducted on three domain generalization datasets, demonstrating empirically that our balanced mini-batch sampling strategy improves the performance of four different established domain generalization model baselines compared to the random mini-batch sampling strategy.

updated: Tue Jul 19 2022 04:12:34 GMT+0000 (UTC)

published: Fri Jun 10 2022 17:59:11 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト