Multi-Outputs Is All You Need For Deblur

Sidun Liu; Peng Qiao; Yong Dou

ブレ除去に必要なのはマルチ出力だけ

画像のブレ除去タスクは不適切な設定であり、ぼやけた画像に対して無限の実行可能なソリューションが存在します。最新の深層学習アプローチは通常、ぼかしカーネルの学習を破棄し、エンドツーエンドの教師あり学習を直接採用します。一般的なブレ除去データセットは、ラベルを実行可能なソリューションの 1 つとして定義しています。ただし、特にラベルがランダムな分布からサンプリングされている場合は、ラベルを直接指定することは合理的ではないと主張します。したがって、ネットワークに実行可能な解の分布を学習させ、この考慮事項に基づいて、分布学習のための新しいマルチヘッド出力アーキテクチャと対応する損失関数を設計することを提案します。私たちのアプローチは、モデルが複数の実行可能なソリューションを出力して、ターゲット分布を近似できるようにします。さらに、パフォーマンスを向上させながらパラメータの数と計算量を削減する新しいパラメータ多重化方法を提案します。現在の最先端のNAFNetを含む、複数の画像ブレ除去モデルでアプローチを評価しました。ベストオーバーオール（検証画像ごとに複数のヘッドの中から最高スコアを選択）の改善 PSNR は、比較されたベースラインを 0.11 ～ 0.18dB まで上回ります。最良の単一ヘッドの改善 (検証セットの複数のヘッドの中から最も性能の高いヘッドを選択) PSNR は、比較されたベースラインを最大 0.04 ~ 0.08dB 上回っています。コードは https://github.com/Liu-SD/multi-output-deblur で入手できます。

Image deblurring task is an ill-posed one, where exists infinite feasible solutions for blurry image. Modern deep learning approaches usually discard the learning of blur kernels and directly employ end-to-end supervised learning. Popular deblurring datasets define the label as one of the feasible solutions. However, we argue that it's not reasonable to specify a label directly, especially when the label is sampled from a random distribution. Therefore, we propose to make the network learn the distribution of feasible solutions, and design based on this consideration a novel multi-head output architecture and corresponding loss function for distribution learning. Our approach enables the model to output multiple feasible solutions to approximate the target distribution. We further propose a novel parameter multiplexing method that reduces the number of parameters and computational effort while improving performance. We evaluated our approach on multiple image-deblur models, including the current state-of-the-art NAFNet. The improvement of best overall (pick the highest score among multiple heads for each validation image) PSNR outperforms the compared baselines up to 0.11~0.18dB. The improvement of the best single head (pick the best-performed head among multiple heads on validation set) PSNR outperforms the compared baselines up to 0.04~0.08dB. The codes are available at https://github.com/Liu-SD/multi-output-deblur.

updated: Sat Aug 27 2022 14:44:18 GMT+0000 (UTC)

published: Sat Aug 27 2022 14:44:18 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト