Adversarial Semantic Hallucination for Domain Generalized Semantic Segmentation

Gabriel Tjio; Ping Liu; Joey Tianyi Zhou; Rick Siow Mong Goh

ドメインの一般化されたセマンティックセグメンテーションのための敵対的なセマンティックハルシネーション

畳み込みニューラルネットワークは、通常、テスト（ターゲットドメイン）データとトレーニング（ソースドメイン）データの分布が大幅に異なる場合、パフォーマンスが低下します。この問題は、ターゲットドメインデータを使用してソースドメインとターゲットドメインの機能表現を調整することで軽減できますが、プライバシーの問題により、ターゲットドメインデータが利用できない場合があります。したがって、トレーニング中のターゲットドメインデータへのアクセスが制限されているにもかかわらず、一般化する方法が必要です。この作業では、クラス条件付き幻覚モジュールとセマンティックセグメンテーションモジュールを組み合わせた敵対的セマンティック幻覚アプローチ（ASH）を提案します。セグメンテーションのパフォーマンスはクラスによって異なるため、ソースドメイン画像のセグメンテーション確率マップのセマンティック情報からアフィン変換パラメーターを生成するセマンティック条件付きスタイルの幻覚モジュールを設計します。すべてのクラスを平等に扱う以前の適応アプローチとは異なり、ASHはクラスごとの違いを考慮します。セグメンテーションモジュールと幻覚モジュールは敵対的に競合し、幻覚モジュールはセグメンテーションモジュールに挑戦するためにますます「困難な」様式化された画像を生成します。それに応じて、セグメンテーションモジュールは、適切なクラスごとの難易度レベルで生成されたサンプルでトレーニングされるため、向上します。 CityscapesとMapillaryのベンチマークデータセットに関する私たちの結果は、私たちの方法が最先端の作品と競争力があることを示しています。コードはhttps://github.com/gabriel-tjio/ASHで入手できます。

Convolutional neural networks typically perform poorly when the test (target domain) and training (source domain) data have significantly different distributions. While this problem can be mitigated by using the target domain data to align the source and target domain feature representations, the target domain data may be unavailable due to privacy concerns. Consequently, there is a need for methods that generalize well despite restricted access to target domain data during training. In this work, we propose an adversarial semantic hallucination approach (ASH), which combines a class-conditioned hallucination module and a semantic segmentation module. Since the segmentation performance varies across different classes, we design a semantic-conditioned style hallucination module to generate affine transformation parameters from semantic information in the segmentation probability maps of the source domain image. Unlike previous adaptation approaches, which treat all classes equally, ASH considers the class-wise differences. The segmentation module and the hallucination module compete adversarially, with the hallucination module generating increasingly "difficult" stylized images to challenge the segmentation module. In response, the segmentation module improves as it is trained with generated samples at an appropriate class-wise difficulty level. Our results on the Cityscapes and Mapillary benchmark datasets show that our method is competitive with state of the art work. Code is made available at https://github.com/gabriel-tjio/ASH.

updated: Tue Oct 26 2021 14:20:35 GMT+0000 (UTC)

published: Tue Jun 08 2021 07:07:45 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト