Adaptive Domain Generalization via Online Disagreement Minimization

Xin Zhang; Ying-Cong Chen

オンライン不一致最小化による適応領域汎化

ディープニューラルネットワークは、デプロイとトレーニングの間で分布のシフトが存在する場合、パフォーマンスが大幅に低下します。 Domain Generalization (DG) は、一連のソースドメインのみに依存することで、目に見えないターゲットドメインにモデルを安全に転送することを目的としています。さまざまな DG アプローチが提案されていますが、DomainBed という名前の最近の研究では、それらのほとんどが単純な経験的リスク最小化 (ERM) に勝てないことが明らかになりました。この目的のために、既存の DG アルゴリズムに直交し、そのパフォーマンスを一貫して改善できる一般的なフレームワークを提案します。静的ソースモデルが普遍的なものになることを期待する以前の DG 作業とは異なり、提案された AdaODM は、さまざまなターゲットドメインのテスト時にソースモデルを適応的に変更します。具体的には、共有ドメインジェネリック特徴抽出器に複数のドメイン固有分類子を作成します。特徴抽出器と分類器は敵対的な方法でトレーニングされます。特徴抽出器は入力サンプルをドメイン不変空間に埋め込み、複数の分類器はそれぞれが特定のソースドメインに関連する明確な決定境界をキャプチャします。テスト中、ソース分類子間の予測の不一致を活用することで、ターゲットドメインとソースドメイン間の分布の違いを効果的に測定できます。ソースモデルを微調整してテスト時の不一致を最小限に抑えることにより、ターゲットドメインの機能は不変の機能空間に適切に配置されます。 ERM と CORAL という 2 つの一般的な DG メソッドと、VLCS、PACS、OfficeHome、TerraIncognita という 4 つの DG ベンチマークで AdaODM を検証します。結果は、AdaODM が目に見えないドメインの一般化能力を安定して改善し、最先端のパフォーマンスを達成することを示しています。

Deep neural networks suffer from significant performance deterioration when there exists distribution shift between deployment and training. Domain Generalization (DG) aims to safely transfer a model to unseen target domains by only relying on a set of source domains. Although various DG approaches have been proposed, a recent study named DomainBed, reveals that most of them do not beat the simple Empirical Risk Minimization (ERM). To this end, we propose a general framework that is orthogonal to existing DG algorithms and could improve their performance consistently. Unlike previous DG works that stake on a static source model to be hopefully a universal one, our proposed AdaODM adaptively modifies the source model at test time for different target domains. Specifically, we create multiple domain-specific classifiers upon a shared domain-generic feature extractor. The feature extractor and classifiers are trained in an adversarial way, where the feature extractor embeds the input samples into a domain-invariant space, and the multiple classifiers capture the distinct decision boundaries that each of them relates to a specific source domain. During testing, distribution differences between target and source domains could be effectively measured by leveraging prediction disagreement among source classifiers. By fine-tuning source models to minimize the disagreement at test time, target domain features are well aligned to the invariant feature space. We verify AdaODM on two popular DG methods, namely ERM and CORAL, and four DG benchmarks, namely VLCS, PACS, OfficeHome, and TerraIncognita. The results show AdaODM stably improves the generalization capacity on unseen domains and achieves state-of-the-art performance.

updated: Sun Jul 09 2023 13:23:14 GMT+0000 (UTC)

published: Wed Aug 03 2022 11:51:11 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト