Who Falls into the Echo Chamber? Identifying Risk Features of Social Media Users via Explainable Machine Learning

Yichang Gao
Peita L. Richards
Lei Gao
Aneta Neumann
Yingping Sun
Juliette Tobias-Webb

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Echo chambers are increasingly recognized as a critical challenge in online information environments where algorithmic filtering and user preferences limit exposure to diverse viewpoints. While existing studies have primarily relied on content- or network-based methods to detect echo chambers, few have focused on identifying echo chambers using machine learning methods and few have looked at the user risk features that predict participation in an echo-chamber. This study proposes a machine learning framework to predict echo chamber membership based on user features drawn from two representative platforms, Weibo and Twitter (X). We utilized a published dataset covering discussions on the Omicron variant and the Tokyo 2020 Olympics, incorporating 20 user features, including 15 general behavioral features and 5 Big Five personality traits. Classification models were developed using three advanced algorithms: Categorical Boosting (CatBoost), Light Gradient Boosting Machine (LightGBM), and Extreme Gradient Boosting (XGBoost). Feature contributions were interpreted using Shapley Additive Explanations (SHAP) and CatBoost was the more efficient (or effective or fast). Results revealed that strong emotional polarity, frequent social interaction, and high neuroticism were key user risk features associated with echo chamber participation across both platforms. The results matter because they move beyond traditional content or network-based explanations of echo chambers, shedding light on the individual level psychological and behavioral traits that make users more vulnerable to these environments. This user-centric perspective allows for a deeper understanding of who is at risk and why, which is critical for designing interventions that don’t just target what people see, but how and why they engage with information in the first place.

Version published to 10.31235/osf.io/a9p36_v1 on OSF Preprints
Feb 9, 2026

Who Falls Into the Trap of Misleading AI-Generated Videos

This article has 4 authors:
1. Yichang Gao
2. Lei Gao
3. Paul Harrigan
4. Fengming Liu
This article has no evaluationsLatest version Mar 4, 2026
Spatiotemporal Analysis of Electronic Cigarette Perception on Twitter/X Using Natural Language Processing

This article has 3 authors:
1. Zidian Xie
2. Jiamu Tang
3. Dongmei Li
This article has no evaluationsLatest version Mar 17, 2026
A Comparative Study of Unsupervised Machine Learning and Deep Learning Techniques for Anomaly Detection in Recommender Systems

This article has 4 authors:
1. Rodolfo Bojorque
2. Remigio Hurtado
3. Miguel Arcos-Argudo
4. Mauricio Ortiz
This article has no evaluationsLatest version Feb 13, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Who Falls Into the Trap of Misleading AI-Generated Videos

Spatiotemporal Analysis of Electronic Cigarette Perception on Twitter/X Using Natural Language Processing

A Comparative Study of Unsupervised Machine Learning and Deep Learning Techniques for Anomaly Detection in Recommender Systems