Fake News Detection Through LLM-Driven Text Augmentation Across Media and Languages

Abdul Sittar
Mateja Smiljanic
Alenka Guček
Marko Grobelnik

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

The proliferation of fake news across social media, headlines, and news articles poses major challenges for automated detection, particularly in multilingual and cross-media settings affected by data imbalance. We propose a fake news detection framework based on LLM-driven, feature-guided text augmentation. The method generates realistic synthetic samples across languages, media types, and text granularities while preserving factual structure and stylistic coherence. Experiments with classical and transformer-based models (Random Forest, Logistic Regression, BERT, XLM-R) across social media, headline, and multilingual news datasets show consistent improvements in performance. LLM-based augmentation improves overall accuracy by up to 1.6% over imbalanced baselines and increases minority-class F1-scores by up to 2.4% in low-resource languages such as Swahili. Hybrid fact- and style-based models achieve up to 93.8% accuracy with more balanced class-wise F1-scores and reduced language-related disparities, demonstrating improved robustness and cross-lingual generalization.

Version published to 10.20944/preprints202603.0360.v1
Mar 5, 2026

From Generation to Detection: Leveraging Empirically Derived Linguistic Hints for LLM-Based Fake News Detection

This article has 1 author:
1. Piyush Ghasiya
This article has no evaluationsLatest version Jan 28, 2026
Multimodal and Multilingual Fake News Detection using MuRIL and Vision Transformers with Explainable AI

This article has 5 authors:
1. Vedaksha M
2. Tanmay Vinayak
3. Abhisikta Maitra
4. Tarun R
5. Deepamala N
This article has no evaluationsLatest version Feb 4, 2026
Comprehensive Smart Data Augmentation withMultiple Transformer Models for ImbalancedSentiment Classification

This article has 5 authors:
1. Prashant Upadhyaya
2. G. L. Saini
3. Ashish Dangi
4. Subodh Bansal
5. Anupam Vyas
This article has no evaluationsLatest version Feb 6, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

From Generation to Detection: Leveraging Empirically Derived Linguistic Hints for LLM-Based Fake News Detection

Multimodal and Multilingual Fake News Detection using MuRIL and Vision Transformers with Explainable AI

Comprehensive Smart Data Augmentation withMultiple Transformer Models for ImbalancedSentiment Classification