Automated Domain Discovery from Multiple Sources to Improve Zero-Shot Generalization

Kowshik Thopalli; Sameeksha Katoch; Pavan Turaga; Jayaraman J. Thiagarajan

ゼロショット一般化を改善するための複数のソースからの自動ドメイン検出

ドメイン一般化 (DG) メソッドは、テスト分布がトレーニングデータとは異なる設定に一般化するモデルを開発することを目的としています。このホワイトペーパーでは、マルチソースゼロショット DG (MDG) の困難な問題に焦点を当てます。MDG では、複数のソースドメインからのラベル付きトレーニングデータを利用できますが、ターゲットドメインからのデータにはアクセスできません。最先端のマルチドメインアンサンブルアプローチを含む、この問題に対して幅広いソリューションが提案されています。これらの進歩にもかかわらず、すべてのソースデータをまとめてプールし、単一の分類子をトレーニングする単純な ERM ソリューションは、標準的なベンチマークで驚くほど効果的です。この論文では、この動作を説明するために、事前に指定されたドメインラベルと MDG パフォーマンスとの間のリンクを解明することが重要であるという仮説を立てています。より具体的には、MDG アルゴリズムの 2 つの一般的なクラス (分散ロバスト最適化 (DRO) とマルチドメインアンサンブル) を検討して、カスタムドメイングループを推測することで、データセットに付属する元のドメインラベルを一貫して改善できることを示します。この目的のために、(i) Group-DRO++ を提案します。これには、既存の DRO 手法でカスタムドメインを識別するための明示的なクラスタリング手順が組み込まれています。（ii）新しいメタ最適化アルゴリズムを使用した暗黙的なドメインの再ラベル付けにより、効果的なマルチドメインアンサンブルを生成するDReaME。複数の標準的なベンチマークに関する実証研究を使用して、当社のバリアントが一貫して ERM を大幅に上回り (1.5% ～ 9%)、最先端の MDG パフォーマンスを生み出すことを示しています。コードは https://github.com/kowshikthopalli/DREAME にあります。

Domain generalization (DG) methods aim to develop models that generalize to settings where the test distribution is different from the training data. In this paper, we focus on the challenging problem of multi-source zero shot DG (MDG), where labeled training data from multiple source domains is available but with no access to data from the target domain. A wide range of solutions have been proposed for this problem, including the state-of-the-art multi-domain ensembling approaches. Despite these advances, the naïve ERM solution of pooling all source data together and training a single classifier is surprisingly effective on standard benchmarks. In this paper, we hypothesize that, it is important to elucidate the link between pre-specified domain labels and MDG performance, in order to explain this behavior. More specifically, we consider two popular classes of MDG algorithms -- distributional robust optimization (DRO) and multi-domain ensembles, in order to demonstrate how inferring custom domain groups can lead to consistent improvements over the original domain labels that come with the dataset. To this end, we propose (i) Group-DRO++, which incorporates an explicit clustering step to identify custom domains in an existing DRO technique; and (ii) DReaME, which produces effective multi-domain ensembles through implicit domain re-labeling with a novel meta-optimization algorithm. Using empirical studies on multiple standard benchmarks, we show that our variants consistently outperform ERM by significant margins (1.5% - 9%), and produce state-of-the-art MDG performance. Our code can be found at https://github.com/kowshikthopalli/DREAME

updated: Fri Nov 04 2022 00:35:38 GMT+0000 (UTC)

published: Fri Dec 17 2021 23:21:50 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト