A Data-Driven Approach to Supporting Fact-Checking and Mitigating Mis/Disinformation Through Domain Quality Evaluation

Kaveh Kadkhoda Mohammadmosaferi
Anna Bertani
Thomas Louf
Riccardo Gallotti

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Misinformation and disinformation spread rapidly on social media, threatening public discourse, democratic processes, and social cohesion. One promising strategy to address these challenges is to evaluate the trustworthiness of entire domains (source websites) as a proxy for content credibility. This approach demands methods that are both scalable and data-driven. However, current solutions like NewsGuard and MBFC rely on expert assessments, cover only a limited number of domains, and often require paid subscriptions. These constraints limit their usefulness for large-scale research.This study introduces a machine-learning-based system designed to assess the quality and trustworthiness of websites. We propose a data-driven approach that leverages a large dataset of expert-rated domains to predict credibility scores for previously unseen domains using domain-level features. Our supervised regression model achieves moderate performance, with a mean absolute error of 0.12. Using feature importance analysis, we found that PageRank-based features provided the greatest reduction in prediction error, confirming that link-based indicators play a central role in domain trustworthiness. This highlights the importance of highly referenced domains in reliable news dissemination. This approach can also help fact-checkers and social media platforms refine their credibility assessment strategies.The solution’s scalable design accommodates the continuously evolving nature of online content, ensuring that evaluations remain timely and relevant. The framework enables continuous assessment of thousands of domains with minimal manual effort. This capability allows stakeholders (social media platforms, media monitoring organizations, content moderators, and researchers) to allocate resources more efficiently, prioritize verification efforts, and reduce exposure to questionable sources. Ultimately, this facilitates a more proactive and effective response to misinformation while also supporting robust public discourse and informed decision-making.

Version published to 10.31219/osf.io/eqf26_v1 on OSF Preprints
Jul 21, 2025

A Hybrid Metadata-Intelligent Framework for Fake News Detection, Ranking, and Web Preservation

This article has 7 authors:
1. Muhammad Faisal Abrar
2. Muhammad Saqib
3. Ali Alferaidi
4. Tariq S. Almuraziq
5. Raza Uddin
6. Wilayat Khan
7. Zawar Khan
This article has no evaluationsLatest version Aug 11, 2025
A Multidimensional Assessment Approach for Knowledge Credibility in Domain-Specific Knowledge Graph

This article has 5 authors:
1. Yin Li
2. Li Liao
3. Ying Zhou
4. Lulu Wang
5. Bixin Li
This article has no evaluationsLatest version Sep 2, 2025
MIND-SBERT: An Explainable and Trustworthy Article Retrieval and Summarization System

This article has 1 author:
1. Ahmad Farooq
This article has no evaluationsLatest version Jul 31, 2025

Listed in

Abstract

Article activity feed

Related articles

A Hybrid Metadata-Intelligent Framework for Fake News Detection, Ranking, and Web Preservation

A Multidimensional Assessment Approach for Knowledge Credibility in Domain-Specific Knowledge Graph

MIND-SBERT: An Explainable and Trustworthy Article Retrieval and Summarization System