Automated Speech-Fluency Explanations for Schizophrenia Diagnosis

Rok Rajher
Mila Marinković
Polona Rus Prelog
Jure Žabkar

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Schizophrenia is a chronic and severe mental disorder that still relies on time-intensive, clinician-administered assessments. Although several automated approaches have been proposed to support diagnosis, these systems often lack the level of explainability necessary for informed clinical decision-making. In this study, we present a fully automated and explainable pipeline for detecting schizophrenia from audio recordings of verbal fluency tests, collected from 126 Slovene-speaking participants (68 healthy controls, 58 individuals diagnosed with schizophrenia), leveraging recent advancements in automatic speech recognition (ASR) and large language model (LLM) systems. We evaluated three ASR models—Truebar, Whisper, and Soniox—for transcription quality, and selected the best-performing system for further processing. We semantically enriched the transcriptions using the generative capabilities of LLMs and extracted both verbal and non-verbal features grounded in established diagnostic criteria. We assessed the relevance of these features using a Bayesian statistical framework and trained multiple classical machine learning models for automatic classification. Our best-performing model, an Explainable Boosting Machine, achieved a classification accuracy of 0.82 and an AUC of 0.90. We further generated visual explanations for the model's predictions, establishing the first fully automated and explainable schizophrenia detection framework developed for the Slovene language. Our approach prioritizes explainability through model-transparent outputs, while still achieving performance comparable to existing automated systems for speech-based schizophrenia detection.

Version published to 10.21203/rs.3.rs-7440282/v1 on Research Square
Sep 9, 2025

Generating Alzheimer's Narratives Using Large Language Models

This article has 5 authors:
1. Paula Andrea Perez-Toro
2. Mahmoud Almizel
3. Elmar Nöth
4. Andreas Maier
5. Tomas Arias-Vergara
This article has no evaluationsLatest version Oct 13, 2025
Normative clinical language data and task specific effects

This article has 2 authors:
1. Brielle C Stark
2. Charalambos Themistocleous
This article has no evaluationsLatest version Sep 22, 2025
Psychiatric Voice Biomarkers: Methodological flaws in pediatric populations

This article has 9 authors:
1. Hammza Jabbar Abd Sattar Hamoudi
2. Mon-Ju Wu
3. Marsal Sanches
4. Cesar A. Soutullo
5. Carolina Olmos
6. Leslie K. Taylor
7. Giovanna Zunta-Soares
8. Jair C. Soares
9. Benson Mwangi
This article has no evaluationsLatest version Oct 15, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Generating Alzheimer's Narratives Using Large Language Models

Normative clinical language data and task specific effects

Psychiatric Voice Biomarkers: Methodological flaws in pediatric populations