Evaluating Large Language Models for Translating Multimodal Phenotype Documentations into Executable EHR Phenotyping Algorithms

Chao Yan
Yi Xin
Wu-Chen Su
Srushti Gangireddy
Shravani Durbhakula
Stephen P. Bruehl
Alyson L. Dickson
Lang Li
QiPing Feng
Bradley A. Malin
Tyler Derr
Wei-Qi Wei

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Research applications of electronic health record (EHR) phenotypes require translating clinical definitions into executable EHR database queries, a labor-intensive process. We evaluated two frontier large language models across five phenotypes and three documentation modalities. Both models captured high-level logic from structured text but degraded markedly with diagram-only input. Error analysis revealed seven failure categories. Documentation, rather than model capability, was the primary bottleneck, reinforcing the need for standardization and expert oversight.

Version published to 10.64898/2026.05.20.26353690 on medRxiv
May 22, 2026

Generation and Evaluation of Realistic Synthetic Clinical Progress Notes for Prostate Cancer using Large Language Models

This article has 6 authors:
1. Álvaro Rey-Blanes
2. Francisco J. Moreno-Barea
3. Javier Veredas-Morente
4. Eloy Vivas-Vargas
5. Fátima Gil-García
6. Francisco J. Veredas
This article has no evaluationsLatest version May 28, 2026
General-purpose large language models can achieve physician-level accuracy in complex medical data extraction

This article has 2 authors:
1. Manu Rajeev
2. Ananthu Narayan
This article has no evaluationsLatest version Jun 10, 2026
Augmenting Structured Diagnoses through Effective Use of Pre-trained Large Language Models on Clinical Notes

This article has 6 authors:
1. Hanieh Razzaghi
2. Nhat Nguyen
3. Mohan Pargi
4. Kaleigh Wieand
5. H. Timothy Bunnell
6. L. Charles Bailey
This article has no evaluationsLatest version Jun 2, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Generation and Evaluation of Realistic Synthetic Clinical Progress Notes for Prostate Cancer using Large Language Models

General-purpose large language models can achieve physician-level accuracy in complex medical data extraction

Augmenting Structured Diagnoses through Effective Use of Pre-trained Large Language Models on Clinical Notes