Domain Generalization Using a Mixture of Multiple Latent Domains

Toshihiko Matsuura; Tatsuya Harada

複数の潜在ドメインの混合を使用したドメインの一般化

基礎となるデータ分布を表すドメインがトレーニングおよびテストプロセス中に変化すると、ディープニューラルネットワークのパフォーマンスが低下します。ドメインの一般化により、複数のソースドメインを使用することで、表示されていないターゲットドメインの一般化パフォーマンスを改善できます。従来の方法では、各サンプルが属するドメインがトレーニングで既知であると想定しています。ただし、Webクロールによって収集されたデータセットなど、多くのデータセットには、各サンプルのドメインが不明な複数の潜在ドメインが混在しています。このホワイトペーパーでは、複数の潜在ドメインの混合を使用したドメインの一般化を、より現実的なシナリオとして紹介します。ここでは、ドメインラベルを使用せずにドメイン一般化モデルをトレーニングします。このシナリオに対処するために、クラスタリングを介して反復的にサンプルを潜在ドメインに分割し、敵対学習により分割された潜在ドメイン間で共有されるドメイン不変特徴抽出器をトレーニングする方法を提案します。画像の潜在領域がそのスタイルに反映されていると想定しているため、クラスタリングにスタイル機能を利用しています。これらの機能を使用することで、提案された方法は、潜在的なドメインを発見し、ドメインラベルが与えられていなくてもドメインの一般化を実現します。実験により、提案手法がドメインラベルを使用せずにドメイン一般化モデルをトレーニングできることがわかります。さらに、ドメインラベルを使用する方法など、従来のドメイン一般化方法よりも優れています。

When domains, which represent underlying data distributions, vary during training and testing processes, deep neural networks suffer a drop in their performance. Domain generalization allows improvements in the generalization performance for unseen target domains by using multiple source domains. Conventional methods assume that the domain to which each sample belongs is known in training. However, many datasets, such as those collected via web crawling, contain a mixture of multiple latent domains, in which the domain of each sample is unknown. This paper introduces domain generalization using a mixture of multiple latent domains as a novel and more realistic scenario, where we try to train a domain-generalized model without using domain labels. To address this scenario, we propose a method that iteratively divides samples into latent domains via clustering, and which trains the domain-invariant feature extractor shared among the divided latent domains via adversarial learning. We assume that the latent domain of images is reflected in their style, and thus, utilize style features for clustering. By using these features, our proposed method successfully discovers latent domains and achieves domain generalization even if the domain labels are not given. Experiments show that our proposed method can train a domain-generalized model without using domain labels. Moreover, it outperforms conventional domain generalization methods, including those that utilize domain labels.

updated: Mon Nov 18 2019 14:31:36 GMT+0000 (UTC)

published: Mon Nov 18 2019 14:31:36 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト