Video-Based Arabic Sign Language Recognition with Mediapipe and Deep Learning Techniques

Dana El-Rushaidat
Nour Almohammad
Raine Yeh
Kinda Fayyad

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

This paper addresses the critical communication barrier experienced by deaf and hearing-impaired individuals in the Arab world through the development of an affordable, video-based Arabic Sign Language (ArSL) recognition system. Designed for broad accessibility, the system eliminates specialized hardware by leveraging standard mobile or laptop cameras. Our methodology employs Mediapipe for real-time extraction of hand, face, and pose landmarks from video streams. These anatomical features are then processed by a hybrid deep learning model integrating Convolutional Neural Networks (CNNs) and Recurrent Neural Networks (RNNs), specifically Bidirectional Long Short-Term Memory (BiLSTM) layers. The CNN component captures spatial features, such as intricate hand shapes and body movements, within individual frames. Concurrently, BiLSTMs model long-term temporal dependencies and motion trajectories across consecutive frames. This integrated CNN-BiLSTM architecture is critical for generating a comprehensive spatiotemporal representation, enabling accurate differentiation of complex signs where meaning relies on both static gestures and dynamic transitions, thus preventing misclassification that CNN-only or RNN-only models would incur. Rigorously evaluated on the author-created JUST-SL dataset and the publicly available KArSL dataset, the system achieved 96% overall accuracy for JUST-SL and an impressive 99% for KArSL. These results demonstrate the system’s superior accuracy compared to previous research, particularly for recognizing full Arabic words, thereby significantly enhancing communication accessibility for the deaf and hearing-impaired community.

Version published to 10.20944/preprints202603.0913.v1
Mar 12, 2026

A Dual-Architecture Deep Learning Pipeline for Real-Time High-Accuracy Arabic Sign Language Recognition

This article has 3 authors:
1. Asmaa Youssef
2. Amira Gaber
3. Shereen M. El-Metwally
This article has no evaluationsLatest version Feb 4, 2026
Edge-Optimized AI-Powered Translator for Indian Sign Language (ISL)

This article has 6 authors:
1. R E Nischal
2. P Koti Darshan
3. Surya Narayan M
4. K A Ramita
5. P Shalini
6. Vijayalakshmi M N
This article has no evaluationsLatest version Mar 18, 2026
Word-level Afan Oromo Sign Language Recognition Using Deep Learning Approach

This article has 2 authors:
1. Solomon Endalu
2. Kula Kakeba
This article has no evaluationsLatest version Mar 25, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

A Dual-Architecture Deep Learning Pipeline for Real-Time High-Accuracy Arabic Sign Language Recognition

Edge-Optimized AI-Powered Translator for Indian Sign Language (ISL)

Word-level Afan Oromo Sign Language Recognition Using Deep Learning Approach