Detection of Adult Content in Arabic Tweets Using Machine Learning Models

Aram Ibrahim Al-anazi

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

This study evaluates the effectiveness of various machine learning and deep learning models in detecting adult content in Arabic tweets, addressing unique linguistic and cultural challenges. Using a dataset of 33,691 Arabic tweets, we implemented and compared Convolutional Neural Networks (CNN), Recurrent Neural Networks (RNN), Long Short-Term Memory networks (LSTM), and AraBERT. The data underwent thorough preprocessing, including cleaning, tokenization, and segmentation into training, validation, and test sets. Performance metrics such as accuracy, F1 score, and confusion matrices were used to assess model efficacy. AraBERT achieved the highest accuracy (100%), demonstrating superior capability in capturing spatial patterns for content classification. CNN and RNN also performed well, with accuracies of 94.27% and 94.22%, respectively, while LSTM achieved an accuracy of 88.37%. These findings highlight AraBERT's potential for effective content moderation in Arabic digital spaces, contributing to safer online environments.

Version published to 10.21203/rs.3.rs-7579505/v1 on Research Square
Sep 17, 2025

Vectorization and Sentiment Analysis of Arabizi Text

This article has 4 authors:
1. noha youssef
2. Sama Gouda
3. Farida Madkour
4. Mona Ibrahim
This article has no evaluationsLatest version Jan 19, 2026
Explainable Amharic Emotional Text Classification Using Transfer Learning

This article has 3 authors:
1. Demeke Endalie
2. Yeshimebet Bayu
3. Tesfa Tegegne
This article has no evaluationsLatest version Jan 13, 2026
ASRD: Development and Validation of a Large-Scale Arabic Semantic Relation Dataset

This article has 6 authors:
1. Randah Alharbi
2. Tarek Helmy
3. Atika Al-Saghyir
4. Safa Aglan
5. Abdulrahman Alosaimy
6. Husni Al-Muhtaseb
This article has no evaluationsLatest version Dec 10, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Vectorization and Sentiment Analysis of Arabizi Text

Explainable Amharic Emotional Text Classification Using Transfer Learning

ASRD: Development and Validation of a Large-Scale Arabic Semantic Relation Dataset