A Dual-Architecture Deep Learning Pipeline for Real-Time High-Accuracy Arabic Sign Language Recognition

Asmaa Youssef
Amira Gaber
Shereen M. El-Metwally

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

This research presents a deep learning-based pipeline for Arabic Sign Language (ArSL) recognition to bridge the communication gap for the Deaf and Hard of Hearing community. We propose a robust system that processes both static images and live video streams, translating isolated gestures into corresponding alphabet letters. Our methodology integrates advanced image preprocessing using Google's MediaPipe for hand landmark detection, along with data augmentation. Two classification approaches are developed: a fine-tuned ResNet18 model achieving 98% test accuracy, and an enhanced architecture employing EfficientNet-B2 as a feature extractor combined with a Random Forest classifier, which achieves 99% accuracy on a diverse, participant-rich dataset of 7,856 labelled RGB images. The superior performance of the latter model demonstrates effective feature extraction and generalization. A functional real-time application validates the system's practical utility, offering an accurate and efficient tool for ArSL recognition.

Version published to 10.21203/rs.3.rs-8605046/v1 on Research Square
Feb 4, 2026

Word-level Afan Oromo Sign Language Recognition Using Deep Learning Approach

This article has 2 authors:
1. Solomon Endalu
2. Kula Kakeba
This article has no evaluationsLatest version Mar 25, 2026
A Modified Vision Transformer for Kurdish Cursive RTL Handwritten Text Recognition

This article has 2 authors:
1. Faraedwn M. Salih
2. Abdulbasit K. Al-talabani
This article has no evaluationsLatest version Apr 6, 2026
Research on Lightweight dynamic gesture recognition model driven by Meta-learning under Small Sample conditions

This article has 3 authors:
1. Yaxu Xue
2. Weidi Huang
3. Chunbiao Gan
This article has no evaluationsLatest version Apr 17, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Word-level Afan Oromo Sign Language Recognition Using Deep Learning Approach

A Modified Vision Transformer for Kurdish Cursive RTL Handwritten Text Recognition

Research on Lightweight dynamic gesture recognition model driven by Meta-learning under Small Sample conditions