A Comparative Analysis of Deep Learning and Machine Learning Approaches for Spam Identification on Telegram

Shuo Xu
Zhanyi Ding
Zijing Wei
Chao Yang
Yixiang Li
Xuanjie Chen
Hailiang Wang

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Spam on messaging apps like Telegram is a serious threat to user security andexperience. In this paper, we compared several machine learning (ML) and deep learning (DL)models to find the most effective way to detect it. We tested our models on a dataset of 20,348messages. We put classic approaches like Logistic Regression and Tree-based Modelsincluding bagging and boosting against modern neural networks—a GRU and the ALBERTtransformer. The results demonstrate that both GRU and ALBERT were the clear winners. TheALBERT model was the top performer, achieving state-of-the-art results with a weightedF1-score of 0.97 and an AUC of 0.9943. The GRU model also delivered excellent performance,with an F1-score of 0.94. Their real strength was in identifying the tricky minority ‘spam’class. Here, ALBERT reached an F1-score of 0.95, and the GRU model scored 0.90,significantly outperforming the other methods. We used McNemar's test to confirm thesefindings were statistically significant. Ultimately, our study sets a new benchmark for spamdetection. It proves that transformer models can effectively secure messaging platforms usingonly the content of the message itself.

Version published to 10.20944/preprints202510.2167.v1
Oct 28, 2025

A Lightweight, Explainable Spam Detection System with Rüppell’s Fox Optimizer for the Social Media Network X

This article has 3 authors:
1. Haidar AlZeyadi
2. Rıdvan Sert
3. Fecir Duran
This article has no evaluationsLatest version Oct 23, 2025
Ensemble Methods and Emerging Paradigms in Credit Card Fraud Detection: A Comparative Study

This article has 3 authors:
1. Mariana López García
2. Carlos Alberto Ramírez Torres
3. José Hernández
This article has no evaluationsLatest version Sep 23, 2025
Multi-Label Machine Learning Models for Trolling and Cyberbullying Prediction

This article has 5 authors:
1. Adenrele A. Afolorunso
2. Oluwasogo A. Okunade
3. Morufu Olalere
4. Adeyinka O. Abiodun
5. Olawale Surajudeen Adebayo
This article has no evaluationsLatest version Oct 29, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

A Lightweight, Explainable Spam Detection System with Rüppell’s Fox Optimizer for the Social Media Network X

Ensemble Methods and Emerging Paradigms in Credit Card Fraud Detection: A Comparative Study

Multi-Label Machine Learning Models for Trolling and Cyberbullying Prediction