A Standard Framework for Converting Coronary Angiography Reports into Machine-Readable Format Using Large Language Models

Read the full article See related articles

Listed in

This article is not in any list yet, why not save it to one of your lists.
Log in to save this article

Abstract

Background and Objectives

Coronary angiography (CAG) reports contain many details about coronary anatomy, lesion characteristics, and interventional procedures. However, their free-text format limits their research utility. Therefore, we sought to develop and validate a framework leveraging large language models (LLMs) to convert CAG reports automatically into a standardized structured format.

Methods

Using 50 CAG reports from a tertiary hospital, we developed a multi-step framework to standardize and extract key information from CAG reports. First, a standard annotation schema was developed by cardiologists. Thereafter, an LLM (GPT-4o) converted the free-text CAG reports into the hierarchical annotation schema in a standardized format. Finally, clinically relevant information was extracted from the standardized schema. One hundred CAG reports from each of two hospitals were used for internal and external test, respectively. The 12 key information points included four CAG-related (previous stent information, lesion characteristics, and anatomical diagnosis) and eight percutaneous coronary intervention (PCI)-related key points (complex PCI criteria and current stent information). For internal test, two interventional cardiologists independently extracted information, with discrepancies resolved through consensus, as reference standard.

Results

Based on the reference standard, the proposed framework demonstrated superior accuracy for CAG-related (99.5% vs. 91.8%; p < 0.001) and comparable accuracy for PCI-related key points (98.3% vs. 97.4%; p = 0.512) in the internal test. External test confirmed high accuracy for both CAG-(96.2%) and PCI-related key points (99.4%).

Conclusions

This framework demonstrated excellent accuracy in standardizing free-text CAG reports, potentially enabling more efficient utilization of detailed clinical data for cardiovascular research.

Author’s Summary

The novel framework that standardizes CAG report is a practical solution to a significant challenge in cardiovascular research — the efficient utilization of detailed procedural and anatomical information untapped in free-text CAG reports. Our framework could enable systematic analysis of large-scale coronary intervention outcomes, reduce the burden of cardologists’ clinical trial recruitment, and support evidence-based clinical decision-making.

Article activity feed