A Novel Approach for Text Extraction and Word Segmentation from Handwritten Document Images Using CNN-RNN Technique

Dimpy Singh
Shalini Puri

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Optical Character Recognition is a technology that takes an optical image of a character as input and generates the corresponding character as output. Its applications span a wide array, encompassing fields such as traffic surveillance, robotics, and the digitization of printed material. Implementation of Optical Character Recognition often involves Convolutional Neural Networks, a widely adopted architecture within the realm of deep learning. Traditional Convolutional Neural Network classifiers excel in learning crucial 2D features within images and subsequently classifying them. This classification process is typically carried out utilizing a SoftMax layer. In this paper, the authors described the optical character recognition by using refined versions of Convolutional Neural Networks and a Recurrent Neural Network classifier. The quality of text recognition was assessed using Character Error Rate and Word Error Rate. Two datasets, IAM and RIMES, were utilized, each divided into training and testing subsets. Accuracy, precision, and recall were calculated based on these divisions. The experimental findings revealed that the Convolutional Neural Networks method achieved notably higher accuracy rates across both datasets, reaching 89.3% and 86%, respectively.

Version published to 10.21203/rs.3.rs-7305332/v1 on Research Square
Sep 11, 2025

Benchmarking OCR and Vision-Language Models for Turkish Text Recognition: A Comprehensive Evaluation Using Synthetic Data

This article has 4 authors:
1. Yasin Yılmaz
2. Erol Görkem Hanoğlu
3. Ayşe Gül Özkan
4. Kasım Öztoprak
This article has no evaluationsLatest version Oct 14, 2025
Scene Text Detection Using Attention with Depthwise Separable Convolutions for Mobile Applications

This article has 2 authors:
1. Ramalakshmi Subbukalai
2. Vani Vijayan
This article has no evaluationsLatest version Sep 11, 2025
A Study on OCR-Based Answer Sheet Evaluation Systems

This article has 7 authors:
1. Chris Mathew Joseph
2. Devika Vinod
3. Dheeraj Krishna
4. Haritha P
5. Josmi Jose
6. Sabeena K
7. Sulaja Sanal
This article has no evaluationsLatest version Oct 20, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Benchmarking OCR and Vision-Language Models for Turkish Text Recognition: A Comprehensive Evaluation Using Synthetic Data

Scene Text Detection Using Attention with Depthwise Separable Convolutions for Mobile Applications

A Study on OCR-Based Answer Sheet Evaluation Systems