From Keywords to Context: Bridging Expert Insight and Language Models for Multidimensional Sleep Health Classification in Clinical Notes

Syed-Amad Hussain
Ariana Calloway
Joseph W Sirrianni
Eric Fosler-Lussier
Mattina Davenport

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Accurate detection of multidimensional sleep health (MSH) information from electronic health records (EHRs) is critical for improving clinical decision-making but remains challenging due to sparse documentation and class imbalance. This study investigates whether integrating expert-guided annotations and keyword-based heuristics with large language models (LLMs) enhances the extraction of nuanced MSH indicators from clinical narratives. Using a novel, expertly annotated dataset (NCH-Sleep), we trained and evaluated models to classify clinical notes across nine clinically relevant MSH categories. Our baseline model demonstrated substantial predictive capability using raw text alone. Incorporating manually annotated spans (oracle annotations) dramatically improved performance, highlighting the benefit of targeted expert guidance. Additionally, employing curated keyword annotations within varying context windows significantly enhanced model interpretability while retaining strong predictive accuracy. Through detailed bias analyses, we identified consistent performance across demographics and clinical settings, although specific disparities underscored the importance of balanced expert oversight. Our findings emphasize the value of expert-informed supervision and heuristic approaches in building scalable, interpretable clinical NLP systems for sleep health classification.

Version published to 10.1101/2025.06.06.25329167v1 on medRxiv
Jun 7, 2025

Empirical Review of LLM-driven Classification of Multidimensional Sleep Health Mentions from Free-Text Clinical Notes

This article has 5 authors:
1. Syed-Amad Hussain
2. Ariana Calloway
3. Joseph Sirrianni
4. Eric Fosler-Lussier
5. Mattina Davenport
This article has no evaluationsLatest version Jun 5, 2025
Automated Insomnia Phenotyping from Electronic Health Records: Leveraging Large Language Models to Decode Clinical Narratives

This article has 11 authors:
1. Guillermo Lopez-Garcia
2. Davy Weissenbacher
3. Matthew Stadler
4. Karen O’Connor
5. Dongfang Xu
6. Lauren Gryboski
7. Jared Heavens
8. Noor Abu-el-Rub
9. Diego R. Mazzotti
10. Subhajit Chakravorty
11. Graciela Gonzalez-Hernandez
This article has no evaluationsLatest version Jun 3, 2025
Implementation of Large Language Models in Electronic Health Records

This article has 3 authors:
1. Maxime Griot
2. Jean Vanderdonckt
3. Demet Yuksel
This article has no evaluationsLatest version Jul 4, 2025

Listed in

Abstract

Article activity feed

Related articles

Empirical Review of LLM-driven Classification of Multidimensional Sleep Health Mentions from Free-Text Clinical Notes

Automated Insomnia Phenotyping from Electronic Health Records: Leveraging Large Language Models to Decode Clinical Narratives

Implementation of Large Language Models in Electronic Health Records