Style Normalization and Restitution for DomainGeneralization and Adaptation

Xin Jin; Cuiling Lan; Wenjun Zeng; Zhibo Chen

ドメインの一般化と適応のためのスタイルの正規化と復元

多くの実用的なコンピュータービジョンアプリケーションでは、学習したモデルは通常、トレーニングに使用されるデータセットで高いパフォーマンスを発揮しますが、トレーニング画像とテスト画像の間にスタイルの違いがある新しい環境に展開すると、パフォーマンスが大幅に低下します。効果的なドメイン一般化可能モデルは、一般化可能で識別可能な特徴表現を学習できることが期待されます。この論文では、ネットワークの高い一般化と識別能力の両方を同時に保証するために、新しいスタイルの正規化と復元モジュール（SNR）を設計します。特にSNRモジュールでは、インスタンスの正規化（IN）を実行してスタイルのバリエーション（照明、色のコントラストなど）を除外し、さまざまなサンプルやドメイン間の不一致が軽減されたスタイルの正規化された機能を取得します。ただし、このようなプロセスはタスクを認識せず、必然的にタスクに関連する識別情報が削除され、パフォーマンスが低下する可能性があります。これを改善するために、タスク関連の識別機能を残差（つまり、元の機能とスタイル正規化機能の違い）から抽出し、それらをネットワークに追加して、高い識別を確保することを提案します。さらに、より良い解きほぐしのために、復元ステップで二重の因果関係損失制約を適用して、タスクに関連する機能とタスクに関連しない機能のより良い分離を促進します。分類、セマンティックセグメンテーション、オブジェクト検出など、さまざまなコンピュータビジョンタスクに対するSNRの有効性を検証します。実験により、SNRモジュールは、多くのタスクでドメイン一般化（DG）および教師なしドメイン適応（UDA）のネットワークのパフォーマンスを向上させることができることが実証されています。コードはhttps://github.com/microsoft/SNRで入手できます。

For many practical computer vision applications, the learned models usually have high performance on the datasets used for training but suffer from significant performance degradation when deployed in new environments, where there are usually style differences between the training images and the testing images. An effective domain generalizable model is expected to be able to learn feature representations that are both generalizable and discriminative. In this paper, we design a novel Style Normalization and Restitution module (SNR) to simultaneously ensure both high generalization and discrimination capability of the networks. In the SNR module, particularly, we filter out the style variations (e.g, illumination, color contrast) by performing Instance Normalization (IN) to obtain style normalized features, where the discrepancy among different samples and domains is reduced. However, such a process is task-ignorant and inevitably removes some task-relevant discriminative information, which could hurt the performance. To remedy this, we propose to distill task-relevant discriminative features from the residual (i.e, the difference between the original feature and the style normalized feature) and add them back to the network to ensure high discrimination. Moreover, for better disentanglement, we enforce a dual causality loss constraint in the restitution step to encourage the better separation of task-relevant and task-irrelevant features. We validate the effectiveness of our SNR on different computer vision tasks, including classification, semantic segmentation, and object detection. Experiments demonstrate that our SNR module is capable of improving the performance of networks for domain generalization (DG) and unsupervised domain adaptation (UDA) on many tasks. Code are available at https://github.com/microsoft/SNR.

updated: Sun Jan 03 2021 09:01:39 GMT+0000 (UTC)

published: Sun Jan 03 2021 09:01:39 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト