HCR-Net: A deep learning based script independent handwritten character recognition network

Vinod Kumar Chauhan; Sukhdeep Singh; Anuj Sharma

HCR-Net：ディープラーニングベースのスクリプトに依存しない手書き文字認識ネットワーク

数十年にわたって広く研究されてきたにもかかわらず、手書き文字認識（HCR）はパターン認識における困難な学習問題と見なされており、スクリプトに依存しないモデルに関する研究は非常に限られています。これは主に、スクリプトの多様性、手作りの特徴抽出技術に関する従来の研究の焦点、および結果を再現するための公開データセットとコードが利用できないためです。一方、ディープラーニングは、HCRを含むパターン認識のさまざまな分野で大きな成功を収めており、エンドツーエンドの学習を提供しますが、特定のスクリプトについてのみ研究されています。この論文では、HCR-Netと呼ばれる、スクリプトに依存しない手書き文字認識のためのエンドツーエンド学習のための転送学習と画像拡張を活用する新しい深層学習アーキテクチャを提案しました。 HCR-Netは、HCRの新しい転移学習アプローチに基づいており、事前にトレーニングされたネットワークの下位層の一部が利用されます。転移学習と画像拡張により、HCR-Netはより高速なトレーニング、より優れたパフォーマンス、より優れた一般化を提供し、最初のエポックで最終精度の最大99％の結果を達成できます。ベンガル語、パンジャブ語、ヒンディー語、英語、スウェーデン語、ウルドゥー語、ペルシア語、チベット語、カンナダ語、マラヤーラム語、テルグ語、マラティ語、ネパール語、アラビア語の公開されているデータセットでの実験結果は、HCR-Netの有効性を証明し、いくつかの新しいベンチマークを確立します。結果の再現性とHCR研究の進歩のために、完全なコードがhttps://github.com/jmdvinodjmd/HCR-Netで公開されています。

Despite being studied extensively for a few decades, handwritten character recognition (HCR) is considered a challenging learning problem in pattern recognition and there is very limited research on script independent models. This is mainly because of diversity of scripts, focus of the conventional research on handcrafted feature extraction techniques, and unavailability of public datasets and codes to reproduce the results. On the other hand, deep learning has witnessed huge success in different areas of pattern recognition, including HCR, and provides end-to-end learning but it has been studied for specific scripts only. In this paper, we have proposed a novel deep learning architecture which exploits transfer learning and image-augmentation for end-to-end learning for script independent handwritten character recognition, called HCR-Net. HCR-Net is based on a novel transfer learning approach for HCR, where some of lower layers of a pre-trained network are utilized. Due to transfer learning and image-augmentation, HCR-Net provides faster training, better performance and better generalizations, and can achieve up to 99% results of its final accuracy in just first epoch. The experimental results on publicly available datasets of Bangla, Punjabi, Hindi, English, Swedish, Urdu, Farsi, Tibetan, Kannada, Malayalam, Telugu, Marathi, Nepali and Arabic languages prove the efficacy of HCR-Net and establishes several new benchmarks. For reproducibility of the results and for the advancements of the HCR research, complete code is publicly released at https://github.com/jmdvinodjmd/HCR-Net.

updated: Thu Apr 14 2022 09:49:49 GMT+0000 (UTC)

published: Sun Aug 15 2021 05:48:07 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト