Design and Implementation of an End-to-End AI-Driven Colonoscopy Recall Workflow at Scale

Aman Mohapatra
Rachel Porth
Si Wong
Heather Hardy
Gail Piatkowski
John Shang
Maelys Amat
Sarah Flier
Adam Salsman
Ted Fitzgerald
Ayad Shammout
David Rubins
Amy Miller
Venkat Jegadeesan
Arvind Ravi
Joseph Feuerstein

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

The rise of structured data elements in Electronic Health Records (EHRs) is a key enabler of improving care quality. However, the transition towards routine use of these fields paradoxically heightens patient safety risks due to increased variability in documentation and the use of placeholder values pending manual review. For large clinical initiatives such as colon cancer screening and surveillance, misinterpretation of recorded clinical data can be particularly problematic, disrupting risk-adapted recall guidance and potentially exacerbating care gaps. This case study details the development and deployment of a Large Language Model (LLM)-driven workflow to extract and transfer unstructured colonoscopy recall recommendations as part of a larger EHR migration. Utilizing GPT-4 Turbo for the core inference step of a fully integrated pipeline —spanning custom SQL queries, Optical Character Recognition (OCR) of historical PDFs, LLM-based inference, and anomaly detection — we successfully structured and migrated population-wide colonoscopy recall data corresponding to over 100,00 patients and 10 years of clinical care. The pipeline demonstrated high accuracy (Macro F1=1.0 against clinician review), scalability, and cost efficiency. We estimate that use of this workflow — relative to the alternative of a default 10-year reminder from last colonoscopy —may prevent over 6,000 new colorectal cancer cases (a projected cost savings of $400-670 million). Key lessons from implementation include the importance of stakeholder alignment, the necessity of robust quality control at scale, and the technical challenges of expanding optimized LLM inference to a fully-fledged end-to-end clinical workflow.

Version published to 10.1101/2025.07.11.25331400 on medRxiv
Jul 14, 2025

ClinicalStatAI: A Cloud-Based, AI-Augmented Platform for Accessible Survival Analysis in Healthcare

This article has 1 author:
1. Fadhaa Ali
This article has no evaluationsLatest version Jun 18, 2025
Implementation of Large Language Models in Electronic Health Records

This article has 3 authors:
1. Maxime Griot
2. Jean Vanderdonckt
3. Demet Yuksel
This article has no evaluationsLatest version Jul 4, 2025
Enhancing Privacy-Preserving Deployable Large Language Models for Perioperative Complication Detection: A Targeted Strategy with LoRA Fine-tuning

This article has 10 authors:
1. Shaowei Gao
2. Xu Zhao
3. Lihui Chen
4. Junrong Yu
5. shuning Tian
6. Huaqiang Zhou
7. jingru Chen
8. Sizhe Long
9. Qiulan He
10. Xia Feng
This article has no evaluationsLatest version Jun 13, 2025

Listed in

Abstract

Article activity feed

Related articles

ClinicalStatAI: A Cloud-Based, AI-Augmented Platform for Accessible Survival Analysis in Healthcare

Implementation of Large Language Models in Electronic Health Records

Enhancing Privacy-Preserving Deployable Large Language Models for Perioperative Complication Detection: A Targeted Strategy with LoRA Fine-tuning