More Than a Model: The Compounding Impact of Behavioral Ambiguity and Task Complexity on Hate Speech Detection

Shuo Xu
Hailiang Wang
Yijun Gao
Yixiang Li
Meng-Ju Kuo

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

The automated detection of hate speech is a critical but difficult task due to its subjective, behavior driven nature, which leads to frequent annotator disagreement. While advanced models (e.g., transformers) are state-of-the-art, it is unclear how their performance is affected by the methodological choice of label aggregation (e.g., ‘majority vote’ vs. ‘unanimous agreement’) and task complexity. We conduct a 2x2 quasi-experimental study to measure the compounding impact of these two factors: Labeling Strategy (low-ambiguity ‘Pure’ data vs. high-ambiguity ‘Majority’ data) and Task Granularity (Binary vs. Multi-class). We evaluate five models (Logistic Regression, Random Forest, LightGBM, GRU, and ALBERT) across four quadrants derived from the HateXplain dataset. We find that (1) ALBERTisthetop-performing modelinall conditions, achieving its peak F1-Score (0.8165) on the ‘Pure’ multi-class task. (2) Label ambiguity is the primary driver of performance loss; ALBERT’s F1-Score drops by ≈15.6% (from 0.8165 to 0.6894) when trained on noisy ‘Majority’ data in the multi-class setting. (3) This negative effect is compounded by task complexity, with the performance drop being nearly twice as severe for the multi-class task as for the binary task. A sensitivity analysis confirmed this drop is attributable to data quality (noise), not sample size. We conclude that behavioral label ambiguity is a more significant bottleneck to mode performance than model architecture, providing strong evidence for a data-centric approach.

Version published to 10.20944/preprints202511.1746.v1
Nov 25, 2025

Can large language models effectively reshape online implicit hate speech? An integrative modelling approach

This article has 6 authors:
1. Yinghui Huang
2. Qixia Feng
3. Hui Liu
4. Weiqing Li
5. Ying Ma
6. Zongkui Zhou
This article has no evaluationsLatest version Jan 14, 2026
The Challenge of Debiasing NLI Models: Why Hypothesis-Only Confidence is Insufficient

This article has 1 author:
1. Huey Phan
This article has no evaluationsLatest version Jan 8, 2026
Empirical Evaluation of Automatic Speech Act Classification: From Logistic Regression to GPT-4o

This article has 3 authors:
1. Sophia Conrad
2. Tilia Ellendorff
3. Gerold Schneider
This article has no evaluationsLatest version Jan 29, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Can large language models effectively reshape online implicit hate speech? An integrative modelling approach

The Challenge of Debiasing NLI Models: Why Hypothesis-Only Confidence is Insufficient

Empirical Evaluation of Automatic Speech Act Classification: From Logistic Regression to GPT-4o