Learning Detail-Structure Alternative Optimization for Blind Super-Resolution

Feng Li; Yixuan Wu; Huihui Bai; Weisi Lin; Runmin Cong; Yao Zhao

ブラインド超解像のための詳細構造代替最適化の学習

既存の畳み込みニューラルネットワーク (CNN) ベースの画像超解像 (SR) メソッドは、バイキュービックカーネルで印象的なパフォーマンスを達成しましたが、これは現実世界のアプリケーションで未知の劣化を処理するには有効ではありません。最近のブラインド SR 法では、ブラーカーネル推定に基づいて SR 画像を再構築することが提案されています。ただし、それらの結果は、推定エラーによる目に見えるアーティファクトと細部の歪みが残ります。これらの問題を軽減するために、この論文では、効果的でカーネルのないネットワーク、つまり DSSR を提案します。これにより、ブラインド SR にカーネルを事前に組み込むことなく、反復的な詳細構造の代替最適化が可能になります。具体的には、DSSR では、画像の詳細と構造の相互作用とコラボレーションを活用するために、詳細構造変調モジュール (DSMM) が構築されています。 DSMM は、詳細復元ユニット (DRU) と構造変調ユニット (SMU) の 2 つのコンポーネントで構成されます。前者は、LR 構造コンテキストからの中間 HR 詳細再構築を回帰することを目的とし、後者は、HR 空間と LR 空間の両方で学習された詳細マップを条件とする構造コンテキスト変調を実行します。さらに、DSMM の出力を隠れ状態として使用し、再帰型畳み込みニューラルネットワーク (RCNN) ビューから DSSR アーキテクチャを設計します。このようにして、ネットワークは画像の詳細と構造的コンテキストを交互に最適化し、時間の経過とともに共同最適化を実現できます。さらに、リカレント接続を備えた DSSR では、アンローリングのたびに以前の HR の詳細とコンテキストを観察することで、低レベルおよび高レベルの機能表現を補完することができます。合成データセットと実世界の画像に関する広範な実験は、私たちの方法が既存の方法に対して最先端を達成することを示しています。ソースコードは、https://github.com/Arcananana/DSSR にあります。

Existing convolutional neural networks (CNN) based image super-resolution (SR) methods have achieved impressive performance on bicubic kernel, which is not valid to handle unknown degradations in real-world applications. Recent blind SR methods suggest to reconstruct SR images relying on blur kernel estimation. However, their results still remain visible artifacts and detail distortion due to the estimation errors. To alleviate these problems, in this paper, we propose an effective and kernel-free network, namely DSSR, which enables recurrent detail-structure alternative optimization without blur kernel prior incorporation for blind SR. Specifically, in our DSSR, a detail-structure modulation module (DSMM) is built to exploit the interaction and collaboration of image details and structures. The DSMM consists of two components: a detail restoration unit (DRU) and a structure modulation unit (SMU). The former aims at regressing the intermediate HR detail reconstruction from LR structural contexts, and the latter performs structural contexts modulation conditioned on the learned detail maps at both HR and LR spaces. Besides, we use the output of DSMM as the hidden state and design our DSSR architecture from a recurrent convolutional neural network (RCNN) view. In this way, the network can alternatively optimize the image details and structural contexts, achieving co-optimization across time. Moreover, equipped with the recurrent connection, our DSSR allows low- and high-level feature representations complementary by observing previous HR details and contexts at every unrolling time. Extensive experiments on synthetic datasets and real-world images demonstrate that our method achieves the state-of-the-art against existing methods. The source code can be found at https://github.com/Arcananana/DSSR.

updated: Sat Dec 03 2022 14:44:17 GMT+0000 (UTC)

published: Sat Dec 03 2022 14:44:17 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト