Human and LLM accent rating of English-L2 speech by Brazilian speakers

Felipe Flores Kupske
Laura Zorzi

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

This study examines whether Large Language Models (LLMs), specifically the free (Flash) and paid (Pro) versions of Google Gemini, can approximate human judgments of English-L2 accentedness produced by Brazilian speakers. Using telephone recordings from a subcorpus of the CSLU: 22 Languages Corpus, accent ratings from three native human judges were compared with AI-generated ratings on a four-point scale. Cumulative Link Mixed Models revealed systematic divergence across raters: AI systems, particularly the Pro version, consistently overestimated accent strength, assigning severe ratings to samples humans judged as moderate. These findings suggest that while human judgments integrate sociolinguistic and contextual cues, AI relies primarily on acoustic deviation, lacking the sociophonetic tolerance required for accent assessment.

Version published to 10.31234/osf.io/q9cj6_v1 on OSF Preprints
Dec 14, 2025

Differentiating second language vowels based on diverse phonetic input

This article has 1 author:
1. Jonas Albæk Villumsen
This article has no evaluationsLatest version Jan 26, 2026
Language Experience Shapes Neural Grouping of Speech by Accent: EEG Evidence from Native, L2, and Heritage Listeners

This article has 3 authors:
1. Lauren Hong
2. Chao Han
3. Philip J. Monahan
This article has no evaluationsLatest version Dec 25, 2025
L3 acquisition of Mandarin Chinese stop consonants by learners with L1 Spanish/Thai and L2 English

This article has 3 authors:
1. Jingyi Han
2. Ting Wang
3. Miao Zhang
This article has no evaluationsLatest version Jan 14, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Differentiating second language vowels based on diverse phonetic input

Language Experience Shapes Neural Grouping of Speech by Accent: EEG Evidence from Native, L2, and Heritage Listeners

L3 acquisition of Mandarin Chinese stop consonants by learners with L1 Spanish/Thai and L2 English