Automated speech content analysis to detect depression with large language models: towards multilingual and few-shot capabilities

Rachid Riad
Alexandre Ducorroy
Sélim Benjamin GUESSOUM
Filomène ROQUEFORT
Adrien Lesage
Xuan-Nga Cao
Alexis Bourla

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Large Language Models (LLMs) offer potential solutions for scalable depression detection across diverse populations. This study evaluates LLM-based speech content analysis for multilingual depression detection in clinical and general populations. We analyzed speech transcripts from three distinct cohorts: Chinese clinical (n = 52), Italian clinical (n = 116), and French general population (n = 1,347). Our LLM-based system, using state-of-the-art open source LLM-model with few-shot prompting, was compared against traditional audio embedding and text embedding approaches for detecting depression and secondary symptoms (anxiety, insomnia, fatigue). The LLM system achieved excellent depression detection with F1-scores of 0.96 (Chinese), 0.85 (Italian), and 0.40 (French), consistently outperforming baseline methods. Depression sensitivity reached 1.00 (Chinese) and 0.93 (French), with high specificity in clinical populations (0.93 Chinese, 0.88 Italian). For secondary symptoms, anxiety detection performed well with high sensitivity (0.85 Chinese, 0.97 French) and F1-scores of 0.78 (Chinese) and 0.31 (French), while performance varied for other symptoms with fatigue detection performing at near-random levels. Statistical analysis revealed language-dependent benefits from few-shot learning, with Chinese datasets particularly benefiting from additional examples when using larger models. Our findings demonstrated that LLM-based speech analysis provides robust multilingual capabilities for depression detection without requiring language-specific training data, offering a scalable solution for mental health screening across diverse populations.

Version published to 10.21203/rs.3.rs-6594999/v1 on Research Square
May 13, 2025

Identifying and Characterizing Eating Disorder Discourse on Chinese Social Media: A Machine Learning Approach

This article has 5 authors:
1. Yuchen Zhang
2. Nanyu Luo
3. Xiaoya Zhang
4. Feng Ji
5. Jinbo He
This article has no evaluationsLatest version Nov 12, 2025
Masked Prediction of Onomatopoeic Expressions in Depressive Tweets Using LLMs

This article has 3 authors:
1. Kouko Miyabe
2. Megumi Kodaka
3. Yusuke Fukazawa
This article has no evaluationsLatest version Oct 21, 2025
Generating Alzheimer's Narratives Using Large Language Models

This article has 5 authors:
1. Paula Andrea Perez-Toro
2. Mahmoud Almizel
3. Elmar Nöth
4. Andreas Maier
5. Tomas Arias-Vergara
This article has no evaluationsLatest version Oct 13, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Identifying and Characterizing Eating Disorder Discourse on Chinese Social Media: A Machine Learning Approach

Masked Prediction of Onomatopoeic Expressions in Depressive Tweets Using LLMs

Generating Alzheimer's Narratives Using Large Language Models