GujaratiHCR: A Hybrid Deep Learning Approach to Handwritten Character Recognition of Gujarati Language

Mehulkumar Dalwadi
Abhishek Mehta

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Handwritten Character Recognition (HCR) for low-resource languages such as Gujarati is still a cumbersome task because of the intricate nature and differential writing styles. This work presents GujaratiHCR, a deep learning hybrid model that tries to recognize handwritten Gujarati text both accurately and linguistically complete. The system initiated here starts off with a good preprocessing phase where grayscale conversion is done, then Adaptive Histogram Equalization (AHE) to enhance contrast, Non-Local Means (NLM) filter for noise filtering, and also morphological cleanup to eliminate the artifacts. This is followed by a refined text segmentation and line detection module based on Canny edge detection with contour-based approaches, and accurate character segmentation through a new combination of the Watershed Transform and a U-Net-based deep learning model. The core of recognition module employs a character-level CNN-LSTM-Transformer hybrid network complemented by n-gram feature extraction and linguistic correction using BERT-based mechanism to improve the coherence of the text. Subsequent to recognition, the system normalizes output by converting to Unicode and performs fine-grained tokenization in syllables and words. Additional linguistic processing involves Part-of-Speech (POS) tagging and Named Entity Recognition (NER) to determine grammatical structure and significant entities for downstream tasks such as speech synthesis. Experimental findings on various measures like accuracy, F-measure, PSNR, SSIM, and BLEU score illustrate that GujaratiHCR remarkably surpasses the performance of other available models like CNN, DCNN, CNN-LSTM, and CapsNet-LSTM with a holistic solution for precise and context-aware Gujarati handwritten text recognition.

Version published to 10.21203/rs.3.rs-7338523/v1 on Research Square
Aug 20, 2025

A Novel Approach for Text Extraction and Word Segmentation from Handwritten Document Images Using CNN-RNN Technique

This article has 2 authors:
1. Dimpy Singh
2. Shalini Puri
This article has no evaluationsLatest version Sep 11, 2025
Deep learning-based approach for identifying writers' gender using Sinhala handwritten text

This article has 2 authors:
1. R.M.P.M. Ramanayake
2. W.A.C. Weerakoon
This article has no evaluationsLatest version Aug 12, 2025
Deep Learning-Based Recognition of Miao Ethnic Costumes via YOLOv5s: A Step Toward Digital Cultural Preservation

This article has 3 authors:
1. Ting Chen
2. Yan Hong
3. Xiaoqun Dai
This article has no evaluationsLatest version Aug 13, 2025

Listed in

Abstract

Article activity feed

Related articles

A Novel Approach for Text Extraction and Word Segmentation from Handwritten Document Images Using CNN-RNN Technique

Deep learning-based approach for identifying writers' gender using Sinhala handwritten text

Deep Learning-Based Recognition of Miao Ethnic Costumes via YOLOv5s: A Step Toward Digital Cultural Preservation