Automated Detection of Referential Features in Schizophrenic Speech Using Large Language Models

DERYA Cokal
Melike Filizer
Martin Villalba
Douglas Turkington
Nicol Ferrier
Klaus von Heusinger
Stuart Watson
Wolfram Hinzen
Massimo Poesio

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Cross-linguistic studies have demonstrated that individuals with schizophrenia—particularly those exhibiting formal thought disorder (FTD)—show distinctive distributions of noun phrases (NPs) in spontaneous speech. NPs (e.g., the picture; a husband) serve to organize the referential structure of meaning. Extracting such referential NP features, however, has traditionally required manual annotations. In this study we applied state-of-the-art large language models (LLMs) to extract these features automatically, using an existing, manually annotated dataset, in which English-speaking participants described a comic strip: 30 individuals with schizophrenia (SZ) (15 with moderate or severe FTD (SZ + FTD), 15 with minimal or no FTD (SZ−FTD), 15 neurotypical controls (NC). We first show that LLM-based analyses replicate the findings based on manual annotation, particularly highlighting that definite NPs tied to prior discourse—markers of grammatical and cognitive complexity and narrative coherence—were significantly underused in the SZ+FTD group. Secondly, we demonstrate that LLMs, especially when used with in-context (few-shot) learning, offer a promising avenue for the automatic extraction of referential features. These results show that a crosslinguistically validated and clinically important linguistic pattern of deviance is accessible to automatized assessment with NLP.

Version published to 10.31234/osf.io/q4sxk_v1 on OSF Preprints
Jul 28, 2025

Automated Speech-Fluency Explanations for Schizophrenia Diagnosis

This article has 4 authors:
1. Rok Rajher
2. Mila Marinković
3. Polona Rus Prelog
4. Jure Žabkar
This article has no evaluationsLatest version Sep 9, 2025
Three Dimensions of Speech Coherence in People with Early Psychosis and Their Family Members

This article has 8 authors:
1. DERYA Cokal
2. Abdulrahman Aloraini
3. Claudio Flores Palominos
4. Cemal Demirlek
5. Burcu Verim
6. Berna Yalınçetin
7. Emre Bora
8. Wolfram Hinzen
This article has no evaluationsLatest version Jul 28, 2025
What is the retest reliability of computationally extractable speech and language markers?

This article has 9 authors:
1. DERYA Cokal
2. Martin Villalba
3. Rui He
4. Claudio Flores Palominos
5. Annkathrin Böke
6. Philipp Homan
7. Klaus von Heusinger
8. Joseph Kambeitz
9. Wolfram Hinzen
This article has no evaluationsLatest version Jul 28, 2025

Listed in

Abstract

Article activity feed

Related articles

Automated Speech-Fluency Explanations for Schizophrenia Diagnosis

Three Dimensions of Speech Coherence in People with Early Psychosis and Their Family Members

What is the retest reliability of computationally extractable speech and language markers?