Improved Test-Time Adaptation for Domain Generalization

Liang Chen; Yong Zhang; Yibing Song; Ying Shan; Lingqiao Liu

ドメインの一般化のためのテスト時間適応の改善

ドメイン一般化 (DG) における主な課題は、トレーニングデータとテストデータの間にある分布シフトの問題を処理することです。最近の研究では、学習したモデルをテストデータに適応させるテスト時間トレーニング (TTT) が、この問題の有望な解決策になる可能性があることが示唆されています。一般に、TTT 戦略は、テストフェーズ中に更新する信頼できるパラメーターを更新および特定するための適切な補助 TTT タスクの選択という 2 つの主な要因に基づいてパフォーマンスを左右します。以前の技術と私たちの実験の両方が、これらの 2 つの要因が適切に考慮されていない場合、TTT は改善されず、学習したモデルに有害である可能性があることを示しています。この作業では、改善されたテスト時間適応 (ITTA) メソッドを提案することにより、これら 2 つの要因に対処します。まず、補助的な目的をヒューリスティックに定義する代わりに、TTT タスクと主な予測タスクの間のより良い調整に向けて調整できる学習可能なパラメーターを含む、TTT タスクの学習可能な整合性損失を提案します。次に、トレーニング済みモデルに追加の適応パラメーターを導入し、テストフェーズ中にのみ適応パラメーターを更新することをお勧めします。大規模な実験を通じて、提案された 2 つの戦略が学習モデルに有益であり (図 1 を参照)、ITTA がいくつかの DG ベンチマークで現在の最先端の方法よりも優れたパフォーマンスを達成できることを示しています。コードは https://github.com/liangchen527/ITTA で入手できます。

The main challenge in domain generalization (DG) is to handle the distribution shift problem that lies between the training and test data. Recent studies suggest that test-time training (TTT), which adapts the learned model with test data, might be a promising solution to the problem. Generally, a TTT strategy hinges its performance on two main factors: selecting an appropriate auxiliary TTT task for updating and identifying reliable parameters to update during the test phase. Both previous arts and our experiments indicate that TTT may not improve but be detrimental to the learned model if those two factors are not properly considered. This work addresses those two factors by proposing an Improved Test-Time Adaptation (ITTA) method. First, instead of heuristically defining an auxiliary objective, we propose a learnable consistency loss for the TTT task, which contains learnable parameters that can be adjusted toward better alignment between our TTT task and the main prediction task. Second, we introduce additional adaptive parameters for the trained model, and we suggest only updating the adaptive parameters during the test phase. Through extensive experiments, we show that the proposed two strategies are beneficial for the learned model (see Figure 1), and ITTA could achieve superior performance to the current state-of-the-art methods on several DG benchmarks. Code is available at https://github.com/liangchen527/ITTA.

updated: Mon Apr 10 2023 10:12:38 GMT+0000 (UTC)

published: Mon Apr 10 2023 10:12:38 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト