Dynamic Instance Domain Adaptation

Zhongying Deng; Kaiyang Zhou; Da Li; Junjun He; Yi-Zhe Song; Tao Xiang

動的インスタンスドメインの適応

教師なしドメイン適応（UDA）に関する既存の研究のほとんどは、各ドメインのトレーニングサンプルにドメインラベル（絵画、写真など）が付属していることを前提としています。各ドメインからのサンプルは同じ分布に従うと想定され、ドメインラベルは、機能の配置を介してドメイン不変の機能を学習するために利用されます。ただし、そのような仮定は当てはまらないことがよくあります。多くの場合、よりきめの細かい領域が多数存在します（たとえば、クラシックスタイルとは劇的に異なる、数十のモダンペインティングスタイルが開発されています）。したがって、人為的に定義された粗視化された各ドメインに機能分布の位置合わせを強制すると、効果がなくなる可能性があります。このホワイトペーパーでは、シングルソースとマルチソースの両方のUDAについて、各インスタンスを細かいドメインと見なすというまったく異なる観点から取り上げます。したがって、ドメイン間の機能の調整は冗長です。代わりに、動的インスタンスドメイン適応（DIDA）を実行することを提案します。具体的には、適応畳み込みカーネルを備えた動的ニューラルネットワークを開発して、インスタンス適応残差を生成し、ドメインにとらわれない深い特徴を個々のインスタンスに適応させます。これにより、ドメインアノテーションに依存することなく、共有分類子をソースドメインデータとターゲットドメインデータの両方に適用できます。さらに、複雑な特徴の位置合わせ損失を課す代わりに、ラベル付けされたソースデータと疑似ラベル付けされたターゲットデータの両方にクロスエントロピー損失のみを使用する単純な半教師あり学習パラダイムを採用します。 DIDA-Netと呼ばれる私たちのモデルは、Digits、Office-Home、DomainNet、Digit-Five、PACSなどの一般的に使用されるいくつかのシングルソースおよびマルチソースUDAデータセットで最先端のパフォーマンスを実現します。

Most existing studies on unsupervised domain adaptation (UDA) assume that each domain's training samples come with domain labels (e.g., painting, photo). Samples from each domain are assumed to follow the same distribution and the domain labels are exploited to learn domain-invariant features via feature alignment. However, such an assumption often does not hold true -- there often exist numerous finer-grained domains (e.g., dozens of modern painting styles have been developed, each differing dramatically from those of the classic styles). Therefore, forcing feature distribution alignment across each artificially-defined and coarse-grained domain can be ineffective. In this paper, we address both single-source and multi-source UDA from a completely different perspective, which is to view each instance as a fine domain. Feature alignment across domains is thus redundant. Instead, we propose to perform dynamic instance domain adaptation (DIDA). Concretely, a dynamic neural network with adaptive convolutional kernels is developed to generate instance-adaptive residuals to adapt domain-agnostic deep features to each individual instance. This enables a shared classifier to be applied to both source and target domain data without relying on any domain annotation. Further, instead of imposing intricate feature alignment losses, we adopt a simple semi-supervised learning paradigm using only a cross-entropy loss for both labeled source and pseudo labeled target data. Our model, dubbed DIDA-Net, achieves state-of-the-art performance on several commonly used single-source and multi-source UDA datasets including Digits, Office-Home, DomainNet, Digit-Five, and PACS.

updated: Wed Mar 09 2022 20:05:54 GMT+0000 (UTC)

published: Wed Mar 09 2022 20:05:54 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト