Multilingual Text Preprocessing and Classification For The Detection Of Extremism And Radicalization In Social Networks

Khadidja Zerrouki
Nadjia Benblidia

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Due to the lack of strict controls on social networks, extremist groups like ISIS, Al-Qaeda, and white supremacists have taken advantage of these platforms to spread their ideas, distribute harmful content, and recruit new members. The information distributed through these channels is comprehensive, diverse, and conveyed in multiple languages. The study of online extremism and radicalization is a multifaceted and intricate area of research. The efficacy of machine learning, deep learning, and natural language processing (NLP) algorithms has been demonstrated. Although the majority of research in this field focuses on the analysis of data in a single language, there needs to be more studies on the analysis of multilingual data, specifically about detecting extremism in multilingual material. This research paper introduces the fabrication of an artificial intelligence system that leverages multilingual text posts from social networks to identify instances of extremism and radicalization. We utilize natural language processing (NLP) linguistic methods and text classification to identify extremism and radicalization in text data. Our study results are outstanding. The Bi-LSTM (Bidirectional et al.) model demonstrates a binary classification accuracy of 97.33%, and the multiclass classification accuracy of the Transformer-based model, which employs the DistilBERT-multi (Distilled version of the Multilingual Bidirectional Encoder Representations from Transformers) pre-trained model, is 91.07%. The findings above show significant progress in multiclass multilingual text classification and the detection of extremism and radicalization within social networks.

Version published to 10.21203/rs.3.rs-4218788/v1 on Research Square
Apr 19, 2024

Detection and Analysis of Offensive Online Content in Hausa Language

This article has 3 authors:
1. Fatima Muhammad Adam
2. Abubakar Yakubu Zandam
3. Isa Inuwa-Dutse
This article has no evaluationsLatest version Apr 26, 2024
Fake news detection in the Hindi Language using Multi-modality via Transfer and Ensemble learning

This article has 2 authors:
1. Sonal Garg
2. Dilip Kumar Sharma
This article has no evaluationsLatest version Mar 16, 2024
CDN: Collaborate Decision Network Based on Cross-modal Attention for Social Media Microblog Recognition

This article has 3 authors:
1. Yuxiang Peng
2. Bingxiang Li
3. Jie Fang
This article has no evaluationsLatest version Apr 18, 2024

Listed in

Abstract

Article activity feed

Related articles

Detection and Analysis of Offensive Online Content in Hausa Language

Fake news detection in the Hindi Language using Multi-modality via Transfer and Ensemble learning

CDN: Collaborate Decision Network Based on Cross-modal Attention for Social Media Microblog Recognition