Weighted Risk Invariance: Domain Generalization under Invariant Feature Shift

Gina Wong; Joshua Gleason; Rama Chellappa; Yoav Wald; Anqi Liu

Learning models whose predictions are invariant under multiple environments is a promising approach for out-of-distribution generalization. Such models are trained to extract features X_inv where the conditional distribution Y \mid X_inv of the label given the extracted features does not change across environments. Invariant models are also supposed to generalize to shifts in the marginal distribution p(X_inv) of the extracted features X_inv, a type of shift we call an invariant covariate shift. However, we show that proposed methods for learning invariant models underperform under invariant covariate shift, either failing to learn invariant modelsx2014even for data generated from simple and well-studied linear-Gaussian modelsx2014or having poor finite-sample performance. To alleviate these problems, we propose weighted risk invariance (WRI). Our framework is based on imposing invariance of the loss across environments subject to appropriate reweightings of the training examples. We show that WRI provably learns invariant models, i.e. discards spurious correlations, in linear-Gaussian settings. We propose a practical algorithm to implement WRI by learning the density p(X_inv) and the model parameters simultaneously, and we demonstrate empirically that WRI outperforms previous invariant learning methods under invariant covariate shift.

updated: Thu Jul 25 2024 23:27:10 GMT+0000 (UTC)

published: Thu Jul 25 2024 23:27:10 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト