Image Detection and Data extraction Using Hybrid Deep Learning Techniques

R V Raghavendra Rao
Ch. Ram Mohan Reddy
Vishruth AC
Prajwal P K

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

In the current age of data, numerous pictures are everywhere that permit the extraction of text and certain image-based information. There are several technologies/ tools that can be used to accomplish this task. Optical Character Recognition (OCR) is a vital technology to automate text extraction from pictures, specifically for identifying people through Identification cards. This paper introduces a hybrid system that integrates few conventional OCR utilities such as PyTesseract with deep learning algorithms, including Mask Region-based Convolutional Neural Networks (R-CNN) for object detection and Convolutional Recurrent Neural Network (CRNN) for text recognition. The system also boosts the text extraction with the application of sophisticated preprocessing techniques such as noise removal, binarization, and edge detection, which enhance image quality and recognition accuracy. After the text is extracted, the text extracted is well-arranged and stored in an Excel file to make it convenient to store and retrieve. The system is compared with the general traditional OCR systems, and that the system demonstrates improvements in accuracy rate, speed of processing, and error correction, Also even under difficult conditions such as low-resolution images and varying lighting. The suggested system is ideal for verification of identities in the majority of the sectors like banking, government, and education. The future developments will involve support for multi-languages and compatibility with mobile devices so that the system becomes even more efficient and versatile with user-friendly.

Version published to 10.21203/rs.3.rs-7065509/v1 on Research Square
Jul 24, 2025

A Comprehensive Comparative Analysis of Convolutional Neural Network Architectures for Image Classification and Object Detection Tasks

This article has 3 authors:
1. Fahim Al Islam
2. Saif Hossain
3. Monir Hosen
This article has no evaluationsLatest version Feb 3, 2026
Enhancing Medical Anomaly Detection via Text-Adapted Few-Shot Learning with Visual-Language Models

This article has 5 authors:
1. Keming Mao
2. Shengbin Hou
3. Haoming Fang
4. Jianzhe Zhao
5. Xinlu Xiao
This article has no evaluationsLatest version Jan 12, 2026
Radiographic Pneumonia Detection and Multiclass Classification Using Deep Learning Models

This article has 4 authors:
1. Ayenew Walle Kebede
2. Melesew Mossie Beyene
3. Addisu Taye Tamene
4. Amare Gedif Yalew
This article has no evaluationsLatest version Feb 3, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

A Comprehensive Comparative Analysis of Convolutional Neural Network Architectures for Image Classification and Object Detection Tasks

Enhancing Medical Anomaly Detection via Text-Adapted Few-Shot Learning with Visual-Language Models

Radiographic Pneumonia Detection and Multiclass Classification Using Deep Learning Models