Fast and High-Quality Image Denoising via Malleable Convolutions

Yifan Jiang; Bartlomiej Wronski; Ben Mildenhall; Jonathan T. Barron; Zhangyang Wang; Tianfan Xue

順応性畳み込みによる高速で高品質の画像ノイズ除去

ほとんどの画像ノイズ除去ネットワークは、静的畳み込みカーネルの単一セットを入力画像全体に適用します。これは、多くの場合、異種の視覚パターンで構成されているため、自然画像には最適ではありません。動的畳み込みは、ピクセルごとの畳み込みカーネルを使用してこの問題に対処しようとしますが、これにより計算コストが大幅に増加します。この作業では、最小限の計算オーバーヘッドで空間可変処理を実行する可鍛性畳み込み (MalleConv) を紹介します。 MalleConv は、静的な畳み込みカーネルとピクセルごとの畳み込みカーネルの間の妥協点である、空間的に変化する畳み込みカーネルのより小さなセットを使用します。これらの空間的に変化するカーネルは、ダウンサンプリングされた入力で実行される効率的な予測ネットワークによって生成されるため、フル解像度の画像によって生成されるピクセルごとのカーネルよりもはるかに効率的に計算でき、静的カーネルと比較してネットワークの受容野も拡大します。次に、これらのカーネルはまとめてアップサンプリングされ、最小限のメモリオーバーヘッドで効率的なオンザフライスライシングオペレーターを介してフル解像度のフィーチャマップに適用されます。 MalleConv の有効性を実証するために、これを使用して MalleNet と呼ばれる効率的なノイズ除去ネットワークを構築します。 MalleNet は、非常に深いアーキテクチャを使用せずに高品質の結果を達成し、最高の性能を発揮するノイズ除去アルゴリズムよりも 8.9 倍速く、同様の視覚的品質を実現します。また、標準の畳み込みベースのバックボーンに単一の MalleConv レイヤーを追加すると、計算コストを大幅に削減するか、同様のコストで画質を向上させることができることも示しています。詳細については、プロジェクトページをご覧ください: https://yifanjiang.net/MalleConv.html

Most image denoising networks apply a single set of static convolutional kernels across the entire input image. This is sub-optimal for natural images, as they often consist of heterogeneous visual patterns. Dynamic convolution tries to address this issue by using per-pixel convolution kernels, but this greatly increases computational cost. In this work, we present Malleable Convolution (MalleConv), which performs spatial-varying processing with minimal computational overhead. MalleConv uses a smaller set of spatially-varying convolution kernels, a compromise between static and per-pixel convolution kernels. These spatially-varying kernels are produced by an efficient predictor network running on a downsampled input, making them much more efficient to compute than per-pixel kernels produced by a full-resolution image, and also enlarging the network's receptive field compared with static kernels. These kernels are then jointly upsampled and applied to a full-resolution feature map through an efficient on-the-fly slicing operator with minimum memory overhead. To demonstrate the effectiveness of MalleConv, we use it to build an efficient denoising network we call MalleNet. MalleNet achieves high-quality results without very deep architectures, making it 8.9x faster than the best performing denoising algorithms while achieving similar visual quality. We also show that a single MalleConv layer added to a standard convolution-based backbone can significantly reduce the computational cost or boost image quality at a similar cost. More information is on our project page: https://yifanjiang.net/MalleConv.html

updated: Tue Aug 09 2022 00:47:24 GMT+0000 (UTC)

published: Sun Jan 02 2022 18:35:20 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト