Learning Noise-Resistant Image Representation by Aligning Clean and Noisy Domains

Yanhui Guo; Xiaolin Wu; Fangzhou Luo

クリーンな領域とノイズの多い領域を並べてノイズに強い画像表現を学習する

最近の教師ありおよび教師なし画像表現学習アルゴリズムは、飛躍的な進歩を遂げています。ただし、これらの手法では、設計パラダイムにおけるノイズに対する表現の回復力が考慮されていません。その結果、これらの効果的な方法は、通常はモデルのトレーニングには不透明な複雑な現実世界のノイズなど、トレーニング分布外のノイズに直面すると失敗します。この問題に対処するために、デュアルドメインは、相互作用を最大化することで、ノイズの多い表現の正規空間、つまりノイズロバスト (NR) ドメインと、双子の正規クリーンスペース、つまりノイズフリー (NF) ドメインを個別にモデル化するように最適化されています。表現間の情報。二重正準領域を考慮して、NR 表現を NF 領域に正確に変換するターゲット誘導型の暗黙的ニューラルマッピング関数を設計し、ノイズ領域を排除することでノイズ耐性のある表現を生成します。提案された方法は、既存の学習システムに容易に統合してノイズに対する堅牢性を向上させることができるスケーラブルなモジュールです。合成データと現実世界のノイズを含むデータの両方を使用したさまざまなタスクの包括的なトライアルにより、提案されたターゲット誘導デュアルドメイン変換 (TDDT) 手法が、複雑なノイズを含む画像に直面しても顕著なパフォーマンスと堅牢性を達成できることが実証されました。

Recent supervised and unsupervised image representation learning algorithms have achieved quantum leaps. However, these techniques do not account for representation resilience against noise in their design paradigms. Consequently, these effective methods suffer failure when confronted with noise outside the training distribution, such as complicated real-world noise that is usually opaque to model training. To address this issue, dual domains are optimized to separately model a canonical space for noisy representations, namely the Noise-Robust (NR) domain, and a twinned canonical clean space, namely the Noise-Free (NF) domain, by maximizing the interaction information between the representations. Given the dual canonical domains, we design a target-guided implicit neural mapping function to accurately translate the NR representations to the NF domain, yielding noise-resistant representations by eliminating noise regencies. The proposed method is a scalable module that can be readily integrated into existing learning systems to improve their robustness against noise. Comprehensive trials of various tasks using both synthetic and real-world noisy data demonstrate that the proposed Target-Guided Dual-Domain Translation (TDDT) method is able to achieve remarkable performance and robustness in the face of complex noisy images.

updated: Mon Jul 03 2023 05:38:28 GMT+0000 (UTC)

published: Mon Jul 03 2023 05:38:28 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト