End-to-End Unsupervised Document Image Blind Denoising

Mehrdad J Gangeh; Marcin Plata; Hamid Motahari; Nigel P Duffy

エンドツーエンドの教師なしドキュメント画像のブラインドノイズ除去

スキャンしたページからノイズを除去することは、光学式文字認識（OCR）システムに送信する前の重要なステップです。ノイズの多い/クリーンなページのペアが必要な場合、利用可能なほとんどの画像ノイズ除去方法が監視されます。ただし、実際の設定では、この仮定が満たされることはめったにありません。また、ドキュメントからさまざまな種類のノイズを除去できる単一のモデルはありません。ここでは、統合されたエンドツーエンドの教師なし深層学習モデルを初めて提案します。これにより、塩やコショウのノイズ、ぼやけたテキストや色あせたテキスト、ドキュメントの透かしなど、複数の種類のノイズを効果的に除去できます。さまざまなレベルの強度で。提案されたモデルが、スキャンされた画像の品質といくつかのテストデータセットのページのOCRを大幅に改善することを示します。

Removing noise from scanned pages is a vital step before their submission to optical character recognition (OCR) system. Most available image denoising methods are supervised where the pairs of noisy/clean pages are required. However, this assumption is rarely met in real settings. Besides, there is no single model that can remove various noise types from documents. Here, we propose a unified end-to-end unsupervised deep learning model, for the first time, that can effectively remove multiple types of noise, including salt \& pepper noise, blurred and/or faded text, as well as watermarks from documents at various levels of intensity. We demonstrate that the proposed model significantly improves the quality of scanned images and the OCR of the pages on several test datasets.

updated: Wed May 19 2021 23:55:15 GMT+0000 (UTC)

published: Wed May 19 2021 23:55:15 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト