Detecting Flagged Comments by Analyzing User Behavior Features in Online Communities

Juan Tang
Xun Tang
Binbin Ning
Kanghong Ma
Linli Li

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Comments violating community guidelines have long been a challenge in online discussion communities, often disrupting the user experience and overall community health. This study provides a quantitative analysis of such flagged comments within a large online Chinese discussion platform, examining factors like comment deletion rates, sentiment, discussion context influence, user voting behavior, and timing of comment publication. Our dataset comprises 6,900,119 historical comments from 17,226 users, collected and meticulously cleaned for analysis. Feature analysis reveals that users with higher comment deletion rates are more prone to trolling behavior. Additionally, flagged first or root comments are found to increase the likelihood of subsequent flagged comments within a discussion. Flagged comments also tend to attract more negative votes and appear earlier in the discussion. Leveraging these behavioral features, we built a high-accuracy predictive model that achieved an AUC of 99.2% in identifying flagged comments.

Version published to 10.21203/rs.3.rs-6617792/v1 on Research Square
May 30, 2025

Density-Based Clustering for Twitter Sentiment Analysis Using Artificial Intelligence

This article has 3 authors:
1. Maya ALGhafri
2. Imran Khan
3. Abdelhamid Abdessalem
This article has no evaluationsLatest version May 15, 2025
Simple changes to content curation algorithms affect the beliefs people form in a collaborative filtering experiment

This article has 3 authors:
1. Jason W. Burton
2. Stefan Michael Herzog
3. Philipp Lorenz-Spreen
This article has no evaluationsLatest version Jun 13, 2025
Emergent Threat Discovery: Unsupervised Machine Learning for Phishing Campaign Analysis

This article has 2 authors:
1. Muhammad Fahad Zia
2. Sri Harish Kalidass
This article has no evaluationsLatest version May 23, 2025

Listed in

Abstract

Article activity feed

Related articles

Density-Based Clustering for Twitter Sentiment Analysis Using Artificial Intelligence

Simple changes to content curation algorithms affect the beliefs people form in a collaborative filtering experiment

Emergent Threat Discovery: Unsupervised Machine Learning for Phishing Campaign Analysis