Dense U-net for super-resolution with shuffle pooling layer

Zhengyang Lu; Ying Chen

シャッフルプーリングレイヤーを備えた超解像のための高密度Uネット

最近の研究は、コンピュータビジョンの分野での深層学習の開発により、単一画像超解像（SISR）で大きな進歩を遂げました。これらの方法では、特徴抽出の前に、単一のフィルター（通常は最大プーリング）を使用して、高解像度の入力画像が低解像度の空間にダウンスケールされます。これは、特徴抽出がバイアスされたフィルター処理された特徴空間で実行されることを意味します。これは最適ではなく、情報の損失を引き起こすことを示しています。この作業では、シャッフルプーリングを使用したDenseU-netと呼ばれる最先端の畳み込みニューラルネットワーク手法を提案しました。これを実現するために、高密度U-netと呼ばれる高密度ブロックを備えた修正U-netがSISRに提案されています。次に、シャッフルプーリングと呼ばれる新しいプーリング戦略が設計されます。これは、小規模な運用のために高密度のU-Netを置き換えることを目的としています。そうすることで、SISRパイプラインの手作りのフィルターを、各機能マップ用に特別にトレーニングされたより損失の多いダウンサンプリングフィルターに効果的に置き換えると同時に、SISR操作全体の情報損失を削減します。さらに、平均二乗誤差（MSE）、構造類似性指数（SSIM）、平均勾配誤差（MGE）を組み合わせた混合損失関数が登場し、知覚損失と高レベルの情報損失を低減します。私たちが提案する方法は、SET14、BSD300、ICDAR2003の3つのベンチマークデータセットで、以前の最先端技術よりも優れた精度を実現します。コードはオンラインで入手できます。

Recent researches have achieved great progress on single image super-resolution(SISR) due to the development of deep learning in the field of computer vision. In these method, the high resolution input image is down-scaled to low resolution space using a single filter, commonly max-pooling, before feature extraction. This means that the feature extraction is performed in biased filtered feature space. We demonstrate that this is sub-optimal and causes information loss. In this work, we proposed a state-of-the-art convolutional neural network method called Dense U-net with shuffle pooling. To achieve this, a modified U-net with dense blocks, called dense U-net, is proposed for SISR. Then, a new pooling strategy called shuffle pooling is designed, which is aimed to replace the dense U-Net for down-scale operation. By doing so, we effectively replace the handcrafted filter in the SISR pipeline with more lossy down-sampling filters specifically trained for each feature map, whilst also reducing the information loss of the overall SISR operation. In addition, a mix loss function, which combined with Mean Square Error(MSE), Structural Similarity Index(SSIM) and Mean Gradient Error (MGE), comes up to reduce the perception loss and high-level information loss. Our proposed method achieves superior accuracy over previous state-of-the-art on the three benchmark datasets: SET14, BSD300, ICDAR2003. Code is available online.

updated: Sat Jan 09 2021 05:58:08 GMT+0000 (UTC)

published: Wed Nov 11 2020 00:59:43 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト