Image preprocessing and modified adaptive thresholding for improving OCR

Rohan Lal Kshetry

OCRを改善するための画像前処理と修正された適応しきい値

この論文では、テキスト内の主要なピクセル強度を見つけ、それに応じて画像をしきい値処理して、光学式文字認識（OCR）モデルに使用しやすくする方法を提案しました。私たちの方法では、画像全体を編集する代わりに、テキストの境界とそれらを塗りつぶす色を除く他のすべての機能を削除しています。このアプローチでは、入力画像からの文字のグレースケール強度がしきい値パラメータの1つとして使用されます。開発されたモデルのパフォーマンスは、PyTesseractによるOCRが続く画像処理の有無にかかわらず、入力画像で最終的に検証されます。得られた結果に基づいて、このアルゴリズムは、OCRの画像処理の分野で効率的に適用できることが観察できます。

In this paper I have proposed a method to find the major pixel intensity inside the text and thresholding an image accordingly to make it easier to be used for optical character recognition (OCR) models. In our method, instead of editing whole image, I are removing all other features except the text boundaries and the color filling them. In this approach, the grayscale intensity of the letters from the input image are used as one of thresholding parameters. The performance of the developed model is finally validated with input images, with and without image processing followed by OCR by PyTesseract. Based on the results obtained, it can be observed that this algorithm can be efficiently applied in the field of image processing for OCR.

updated: Tue Nov 30 2021 04:04:33 GMT+0000 (UTC)

published: Sun Nov 28 2021 08:13:20 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト