Wearable and Interview-based Assessment of Psychological Risk in Alzheimer’s Caregivers: Machine Learning vs. Large Language Models

Junen Xiao
Zihan Zhao
Zachary D. King
Maryam Khalid
Sara Davies
Khadija Zanna
Daniel L. Argueta
Kelly N. Brice
E. Lydia Wu-Chung
Vincent D. Lai
Jensine Paoletti-Hatcher
Bryan T. Denny
Samantha Henry
Paul E. Schulz
Christopher P. Fagundes
Akane Sano

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Spousal caregivers of individuals with Alzheimer’s disease and related dementias frequently experience elevated perceived stress, caregiver burden, and loneliness, which are associated with adverse health outcomes. Early identification is therefore critical for timely intervention. Existing approaches commonly rely on wearable sensor data and standardized psychological questionnaires, while recent multimodal methods aim to improve prediction by integrating behavioral and linguistic information.

In this study, we explored three modality configurations, wearable-derived features, interview-based text, and their combination, to classify caregiver psychological risk using the Perceived Stress Scale (PSS), Zarit Burden Interview, and UCLA Loneliness Scale. We compared traditional machine learning models and large language models (LLMs) (Gemini 2.0, Llama 4, and GPT-4o) under psychometrician-centered and caregiver-centered prompting strategies.

Traditional machine learning models performed better under multimodal settings, while LLMs achieved stronger performance with Interview-Only input. We further demonstrate that PSS was the most predictable construct and prompting strategies substantially influenced LLM performance.

Author summary

People caring for spouses with Alzheimer’s disease and related dementias often experience high levels of stress, caregiver burden, and loneliness, all linked to adverse psychological and physical health outcomes. Early identification of caregivers at heightened psychological risk is essential for timely support. We evaluated three data modalities, wearable-derived features, interview-based text, and their combination, to classify caregiver risk using the Perceived Stress Scale, Zarit Burden Interview, and UCLA Loneliness Scale. Traditional machine learning models and large language models (LLMs) (Gemini 2.0, Llama 4, GPT-4o) were compared under multiple prompting strategies.

Our findings showed that traditional machine learning approaches performed best when combining wearable-derived behavioral features with interview-derived linguistic features, while LLMs were more effective for analyzing interview-based text. PSS was the most predictable construct, while caregiver burden and loneliness were more difficult to detect. Prompting choices significantly influenced LLM performance, and Gemini 2.0 showed the most stable overall results. These findings highlight the importance of aligning model choice with data modality when developing digital health tools for caregiver risk identification.

Version published to 10.64898/2026.05.24.26353993 on medRxiv
May 27, 2026

Wearable sensing for quantifying cognitive and balance functions in naturalistic movements of older adults with mild cognitive impairment in therapeutic environments

This article has 9 authors:
1. Jangwon Lim
2. Rahul Islam
3. Dharini Raghavan
4. Bolaji Omofojoye
5. Amy D. Rodriguez
6. Yashar Kiarashi
7. Rachel Hershenberg
8. Gari D. Clifford
9. Hyeokhyen Kwon
This article has no evaluationsLatest version Jun 22, 2026
Predicting Depression and Anxiety Progression in Multiple Sclerosis from Longitudinal Clinical Data Using Machine Learning

This article has 6 authors:
1. Bernhard Specht
2. Samaher Garbaya
3. Reinhard Schneider
4. Ricardo Chavarriaga
5. Djamel Khadraoui
6. Zied Tayeb
This article has no evaluationsLatest version Jun 25, 2026
Accelerometry-Derived REM Sleep Behavior Disorder Predicts Future Parkinson’s Disease in the UK Biobank

This article has 10 authors:
1. Giorgio Ricciardiello Mejia
2. Andreas Brink-Kjaer
3. Lang Liu
4. Li Zhou
5. Katarina Gunter
6. Kang Hyun Ryu
7. Sajila Wickramaratne
8. Ankit Parekh
9. Ziv Gan-Or
10. Emmanuel During
This article has no evaluationsLatest version Jul 6, 2026

Discuss this preprint

Listed in

Abstract

Author summary

Article activity feed

Related articles

Wearable sensing for quantifying cognitive and balance functions in naturalistic movements of older adults with mild cognitive impairment in therapeutic environments

Predicting Depression and Anxiety Progression in Multiple Sclerosis from Longitudinal Clinical Data Using Machine Learning

Accelerometry-Derived REM Sleep Behavior Disorder Predicts Future Parkinson’s Disease in the UK Biobank