Transformer-Based Source-Free Domain Adaptation

Guanglei Yang; Hao Tang; Zhun Zhong; Mingli Ding; Ling Shao; Nicu Sebe; Elisa Ricci

トランスフォーマーベースのソースフリードメイン適応

この論文では、ターゲット適応中にソースデータが利用できないソースフリードメイン適応（SFDA）のタスクを研究します。 SFDA に関するこれまでの作業は、主にクロスドメイン分布の調整に重点を置いていました。ただし、彼らは、事前トレーニング済みのソースモデルの一般化能力を無視しています。これは、ターゲットの適応段階に不可欠な最初のターゲットアウトプットに大きく影響します。これに対処するために、モデルの精度は、画像内のオブジェクトに注意が向けられているかどうかと高い相関関係があるという興味深い観察を行います。この目的のために、SFDA の一般化モデルを学習するための、TransDA という名前の Transformer に基づく一般的で効果的なフレームワークを提案します。具体的には、Transformer をアテンションモジュールとして適用し、それを畳み込みネットワークに挿入します。そうすることで、モデルはオブジェクト領域に注意を向けるように促され、ターゲット領域でのモデルの一般化能力を効果的に向上させることができます。さらに、新しい自己監視知識抽出アプローチが提案され、ターゲット疑似ラベルでトランスフォーマーを適応させ、ネットワークがオブジェクト領域に焦点を合わせるようにさらに奨励します。閉集合、部分集合、開集合の適応を含む 3 つのドメイン適応タスクの実験は、TransDA が適応精度を大幅に改善し、最先端の結果を生成できることを示しています。ソースコードとトレーニング済みモデルは、https://github.com/ygjwd12345/TransDA で入手できます。

In this paper, we study the task of source-free domain adaptation (SFDA), where the source data are not available during target adaptation. Previous works on SFDA mainly focus on aligning the cross-domain distributions. However, they ignore the generalization ability of the pretrained source model, which largely influences the initial target outputs that are vital to the target adaptation stage. To address this, we make the interesting observation that the model accuracy is highly correlated with whether or not attention is focused on the objects in an image. To this end, we propose a generic and effective framework based on Transformer, named TransDA, for learning a generalized model for SFDA. Specifically, we apply the Transformer as the attention module and inject it into a convolutional network. By doing so, the model is encouraged to turn attention towards the object regions, which can effectively improve the model's generalization ability on the target domains. Moreover, a novel self-supervised knowledge distillation approach is proposed to adapt the Transformer with target pseudo-labels, thus further encouraging the network to focus on the object regions. Experiments on three domain adaptation tasks, including closed-set, partial-set, and open-set adaption, demonstrate that TransDA can greatly improve the adaptation accuracy and produce state-of-the-art results. The source code and trained models are available at https://github.com/ygjwd12345/TransDA.

updated: Fri May 28 2021 23:06:26 GMT+0000 (UTC)

published: Fri May 28 2021 23:06:26 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト