HCR-Net: A deep learning based script independent handwritten character recognition network

Vinod Kumar Chauhan; Sukhdeep Singh; Anuj Sharma

HCR-Net: 深層学習ベースのスクリプトに依存しない手書き文字認識ネットワーク

数十年にわたって広く研究されてきたにもかかわらず、手書き文字認識 (HCR) は依然としてパターン認識における困難な学習問題と見なされており、スクリプトに依存しないモデルに関する研究は非常に限られています。これは主に、文字の構造の類似性、異なる手書きスタイル、ノイズの多いデータセット、スクリプトの多様性、手作りの特徴抽出技術に関する従来の研究の焦点、および結果を再現するための公開データセットとコードリポジトリが利用できないためです。一方、深層学習は、HCR を含むパターン認識のさまざまな分野で大きな成功を収めており、エンドツーエンドの学習を提供します。ただし、ディープラーニング技術は計算コストが高く、トレーニングに大量のデータが必要であり、特定のスクリプト専用に開発されています。上記の制限に対処するために、HCR-Net と呼ばれる、スクリプトに依存しない手書き文字認識のための新しい汎用ディープラーニングアーキテクチャを提案しました。 HCR-Net は、HCR の新しい転移学習アプローチに基づいており、事前にトレーニングされたネットワークの特徴抽出レイヤーを部分的に利用します。転移学習と画像増強により、HCR-Net は、より高速で計算効率の高いトレーニング、より優れたパフォーマンスとより優れた一般化を提供し、小さなデータセットで動作できます。 HCR-Net は、バングラ語、パンジャブ語、ヒンディー語、英語、スウェーデン語、ウルドゥー語、ペルシャ語、チベット語、カンナダ語、マラヤーラム語、テルグ語、マラーティー語、ネパール語、アラビア語の 40 の公的に利用可能なデータセットで広範囲に評価され、26 の新しいベンチマーク結果を確立しました。残りのケースで最良の結果を得るために。 HCR-Net は、既存の結果に対して最大 11% のパフォーマンス向上を示し、非常に最初のエポックで最終パフォーマンスの最大 99% を示す高速収束率を達成しました。 HCR-Net は、最先端の転移学習技術を大幅に上回りました...

Despite being studied extensively for a few decades, handwritten character recognition (HCR) is still considered a challenging learning problem in pattern recognition, and there is very limited research on script independent models. This is mainly because of similarity in structure of characters, different handwriting styles, noisy datasets, diversity of scripts, focus of the conventional research on handcrafted feature extraction techniques, and unavailability of public datasets and code-repositories to reproduce the results. On the other hand, deep learning has witnessed huge success in different areas of pattern recognition, including HCR, and provides an end-to-end learning. However, deep learning techniques are computationally expensive, need large amount of data for training and have been developed for specific scripts only. To address the above limitations, we have proposed a novel generic deep learning architecture for script independent handwritten character recognition, called HCR-Net. HCR-Net is based on a novel transfer learning approach for HCR, which partly utilizes feature extraction layers of a pre-trained network. Due to transfer learning and image-augmentation, HCR-Net provides faster and computationally efficient training, better performance and better generalizations, and can work with small datasets. HCR-Net is extensively evaluated on 40 publicly available datasets of Bangla, Punjabi, Hindi, English, Swedish, Urdu, Farsi, Tibetan, Kannada, Malayalam, Telugu, Marathi, Nepali and Arabic languages, and established 26 new benchmark results while performed close to the best results in the rest cases. HCR-Net showed performance improvements up to 11% against the existing results and achieved a fast convergence rate showing up to 99% of final performance in the very first epoch. HCR-Net significantly outperformed the state-of-the-art transfer learning techniques...

updated: Tue Jan 31 2023 19:47:50 GMT+0000 (UTC)

published: Sun Aug 15 2021 05:48:07 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト