Style Normalization and Restitution for Domain Generalization and Adaptation

Xin Jin; Cuiling Lan; Wenjun Zeng; Zhibo Chen

ドメインの一般化と適応のためのスタイルの正規化と復元

多くの実用的なコンピュータービジョンアプリケーションでは、学習されたモデルは通常、トレーニングに使用されるデータセットで高いパフォーマンスを発揮しますが、新しい環境に展開すると、通常、トレーニングイメージとテストイメージの間にスタイルの違いがある場合、パフォーマンスが大幅に低下します。効果的なドメイン汎化可能モデルは、汎化可能かつ識別可能な特徴表現を学習できることが期待されます。この論文では、新しいスタイルの正規化と復元モジュール (SNR) を設計して、ネットワークの高い一般化と識別能力の両方を同時に確保します。特に、SNR モジュールでは、インスタンス正規化 (IN) を実行してスタイルの変動 (照明、色のコントラストなど) を除外して、スタイルの正規化機能を取得し、異なるサンプルやドメイン間の不一致を減らします。ただし、このようなプロセスはタスクを認識せず、必然的にタスクに関連する識別情報を削除するため、パフォーマンスが低下する可能性があります。これを改善するために、タスク関連の識別機能を残差 (つまり、元の機能とスタイル正規化機能との違い) から抽出し、ネットワークに追加して高い識別を確保することを提案します。さらに、より良い絡み合いを解消するために、回復ステップで二重の因果関係の損失制約を適用して、タスクに関連する機能とタスクに関係のない機能のより適切な分離を促進します。分類、セマンティックセグメンテーション、オブジェクト検出など、さまざまなコンピュータービジョンタスクでの SNR の有効性を検証します。実験により、当社の SNR モジュールが、多くのタスクでドメイン汎化 (DG) および教師なしドメイン適応 (UDA) のネットワークのパフォーマンスを改善できることが示されています。コードは https://github.com/microsoft/SNR で入手できます。

For many practical computer vision applications, the learned models usually have high performance on the datasets used for training but suffer from significant performance degradation when deployed in new environments, where there are usually style differences between the training images and the testing images. An effective domain generalizable model is expected to be able to learn feature representations that are both generalizable and discriminative. In this paper, we design a novel Style Normalization and Restitution module (SNR) to simultaneously ensure both high generalization and discrimination capability of the networks. In the SNR module, particularly, we filter out the style variations (e.g, illumination, color contrast) by performing Instance Normalization (IN) to obtain style normalized features, where the discrepancy among different samples and domains is reduced. However, such a process is task-ignorant and inevitably removes some task-relevant discriminative information, which could hurt the performance. To remedy this, we propose to distill task-relevant discriminative features from the residual (i.e, the difference between the original feature and the style normalized feature) and add them back to the network to ensure high discrimination. Moreover, for better disentanglement, we enforce a dual causality loss constraint in the restitution step to encourage the better separation of task-relevant and task-irrelevant features. We validate the effectiveness of our SNR on different computer vision tasks, including classification, semantic segmentation, and object detection. Experiments demonstrate that our SNR module is capable of improving the performance of networks for domain generalization (DG) and unsupervised domain adaptation (UDA) on many tasks. Code are available at https://github.com/microsoft/SNR.

updated: Fri Mar 11 2022 03:15:04 GMT+0000 (UTC)

published: Sun Jan 03 2021 09:01:39 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト