Enhancing Substance Use Detection in Clinical Notes with Large Language Models

Fabrice Harel-Canada
Anabel Salimian
Brandon Moghanian
Sarah Clingan
Allan Nguyen
Tucker Avra
Michelle Poimboeuf
Ruby Romero
Arthur Funnell
Panayiotis Petousis
Michael Shin
Nanyun Peng
Chelsea Shover
David Goodman-Meza

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Identifying substance use behaviors in electronic health records (EHRs) is challenging because critical details are often buried in unstructured notes that use varied terminology and negation, requiring careful contextual interpretation to distinguish relevant use from historical mentions or denials. Using MIMIC-III/IV discharge summaries, we created a large, annotated drug detection dataset to tackle this problem and support future systemic substance use surveillance. We then investigated the performance of multiple large language models (LLMs) for detecting eight substance use categories within this data. Evaluating models in zero-shot, few-shot, and fine-tuning configurations, we found that a fine-tuned model, Llama-DrugDetector-70B, outperformed others. It achieved near-perfect F1-scores (>=0.95) for most individual substances and strong scores for more complex tasks like prescription opioid misuse (F1=0.815) and polysubstance use (F1=0.917). These findings demonstrate that LLMs significantly enhance detection, showing promise for clinical decision support and research, although further work on scalability is warranted.

Version published to 10.21203/rs.3.rs-6615981/v1 on Research Square
May 15, 2025

Empirical Review of LLM-driven Classification of Multidimensional Sleep Health Mentions from Free-Text Clinical Notes

This article has 5 authors:
1. Syed-Amad Hussain
2. Ariana Calloway
3. Joseph Sirrianni
4. Eric Fosler-Lussier
5. Mattina Davenport
This article has no evaluationsLatest version Jun 5, 2025
Automated Insomnia Phenotyping from Electronic Health Records: Leveraging Large Language Models to Decode Clinical Narratives

This article has 11 authors:
1. Guillermo Lopez-Garcia
2. Davy Weissenbacher
3. Matthew Stadler
4. Karen O’Connor
5. Dongfang Xu
6. Lauren Gryboski
7. Jared Heavens
8. Noor Abu-el-Rub
9. Diego R. Mazzotti
10. Subhajit Chakravorty
11. Graciela Gonzalez-Hernandez
This article has no evaluationsLatest version Jun 3, 2025
SmokeBERT: A BERT-based Model for Quantitative Smoking History Extraction from Clinical Narratives to Improve Lung Cancer Screening

This article has 9 authors:
1. Yiming Xue
2. Yunzheng Zhu
3. Luoting Zhuang
4. YongKyung Oh
5. Ricky Taira
6. Denise R. Aberle
7. Ashley E. Prosper
8. William Hsu
9. Yannan Lin
This article has no evaluationsLatest version Jun 20, 2025

Listed in

Abstract

Article activity feed

Related articles

Empirical Review of LLM-driven Classification of Multidimensional Sleep Health Mentions from Free-Text Clinical Notes

Automated Insomnia Phenotyping from Electronic Health Records: Leveraging Large Language Models to Decode Clinical Narratives

SmokeBERT: A BERT-based Model for Quantitative Smoking History Extraction from Clinical Narratives to Improve Lung Cancer Screening