Rethinking Generative Methods for Image Restoration in Physics-based Vision: A Theoretical Analysis from the Perspective of Information

Xudong Kang; Haoran Xie; Jing Qin; Man-Leung Wong

物理ベースの視覚における画像復元のための生成手法の再考: 情報の観点からの理論的分析

エンドツーエンドの生成手法は、手作りのコンポジションモデルに基づく従来の脱構築手法と比較して、物理ベースのビジョンにおける画像復元のためのより有望なソリューションと見なされます。ただし、既存の生成メソッドには、定量的なパフォーマンスを改善する余地がまだたくさんあります。さらに重要なことに、これらの方法は解釈可能性が弱いためブラックボックスと見なされており、そのメカニズムと学習プロセスを説明しようとする理論はほとんどありません.この研究では、情報理論を使用して画像復元タスクのこれらの生成方法を再解釈しようとします。従来の理解とは異なり、これらの方法の情報の流れを分析し、3つの情報源（抽出された高レベルの情報、保持された低レベルの情報、およびソース入力から欠落している外部情報）が関与し、それぞれが生成に最適化されていることを特定しました。復元結果です。さらに、情報のボトルネック原理を拡張することにより、彼らの学習行動、最適化の目的、および対応する情報の境界を導き出しました。この理論的枠組みに基づいて、多くの既存の生成方法は、従来の生成タスク用に設計された一般モデルの直接的な適用になる傾向があることがわかりました。これは、過度に投資された抽象化プロセス、固有の詳細の損失、勾配の消失または不均衡などの問題に悩まされる可能性があります。トレーニング。これらの問題を直感的説明と理論的説明の両方で分析し、それぞれ経験的証拠で証明しました。最終的に、上記の問題に対処するための一般的なソリューションまたはアイデアを提案し、これらのアプローチを検証して、3 つの異なる画像復元タスクの 6 つのデータセットでパフォーマンスを向上させました。

End-to-end generative methods are considered a more promising solution for image restoration in physics-based vision compared with the traditional deconstructive methods based on handcrafted composition models. However, existing generative methods still have plenty of room for improvement in quantitative performance. More crucially, these methods are considered black boxes due to weak interpretability and there is rarely a theory trying to explain their mechanism and learning process. In this study, we try to re-interpret these generative methods for image restoration tasks using information theory. Different from conventional understanding, we analyzed the information flow of these methods and identified three sources of information (extracted high-level information, retained low-level information, and external information that is absent from the source inputs) are involved and optimized respectively in generating the restoration results. We further derived their learning behaviors, optimization objectives, and the corresponding information boundaries by extending the information bottleneck principle. Based on this theoretic framework, we found that many existing generative methods tend to be direct applications of the general models designed for conventional generation tasks, which may suffer from problems including over-invested abstraction processes, inherent details loss, and vanishing gradients or imbalance in training. We analyzed these issues with both intuitive and theoretical explanations and proved them with empirical evidence respectively. Ultimately, we proposed general solutions or ideas to address the above issue and validated these approaches with performance boosts on six datasets of three different image restoration tasks.

updated: Mon Dec 05 2022 12:16:27 GMT+0000 (UTC)

published: Mon Dec 05 2022 12:16:27 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト