Test-time Adaptation in the Dynamic World with Compound Domain Knowledge Management

Junha Song; Kwanyong Park; InKyu Shin; Sanghyun Woo; Chaoning Zhang; In So Kweon

複合ドメイン知識管理による動的世界でのテスト時の適応

ロボットシステムを展開する前に、すべての潜在的な視覚的ケースで深層認識モデルを事前トレーニングすることは、実際には不可能です。したがって、テスト時間適応 (TTA) により、モデルは新しい環境に適応し、テスト時間中のパフォーマンスを向上させることができます (つまり、生涯適応)。 TTA のいくつかの研究は、継続的に変化する環境で有望な適応性能を示しています。ただし、調査により、既存の方法は動的な分布の変化に対して脆弱であり、多くの場合、TTA モデルの過剰適合につながることが明らかになりました。この問題に対処するために、このホワイトペーパーではまず、複合ドメインの知識管理を備えた堅牢な TTA フレームワークを紹介します。私たちのフレームワークは、TTA モデルが複数の代表的なドメイン (つまり、複合ドメイン) の知識を収集し、複合ドメインの知識に基づいて TTA を実行するのに役立ちます。さらに、TTA モデルのオーバーフィッティングを防ぐために、ソースドメインと現在のターゲットドメインの間のドメイン類似性を使用して適応率を調整する新しい正則化を考案しました。提案されたフレームワークと正則化の相乗効果により、さまざまな TTA シナリオ、特に動的なドメインシフトで一貫したパフォーマンスの向上を実現します。 ImageNet-C での画像分類、GTA5 でのセマンティックセグメンテーション、C 駆動、および破損した Cityscapes データセットを含む広範な実験を通じて、提案の一般性を実証します。

Prior to the deployment of robotic systems, pre-training the deep-recognition models on all potential visual cases is infeasible in practice. Hence, test-time adaptation (TTA) allows the model to adapt itself to novel environments and improve its performance during test time (i.e., lifelong adaptation). Several works for TTA have shown promising adaptation performances in continuously changing environments. However, our investigation reveals that existing methods are vulnerable to dynamic distributional changes and often lead to overfitting of TTA models. To address this problem, this paper first presents a robust TTA framework with compound domain knowledge management. Our framework helps the TTA model to harvest the knowledge of multiple representative domains (i.e., compound domain) and conduct the TTA based on the compound domain knowledge. In addition, to prevent overfitting of the TTA model, we devise novel regularization which modulates the adaptation rates using domain-similarity between the source and the current target domain. With the synergy of the proposed framework and regularization, we achieve consistent performance improvements in diverse TTA scenarios, especially on dynamic domain shifts. We demonstrate the generality of proposals via extensive experiments including image classification on ImageNet-C and semantic segmentation on GTA5, C-driving, and corrupted Cityscapes datasets.

updated: Sat Apr 15 2023 04:03:04 GMT+0000 (UTC)

published: Fri Dec 16 2022 09:02:01 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト