Community-acquired pneumonia identification from electronic health records in the absence of a gold standard: a Bayesian latent class analysis

Read the full article See related articles

Listed in

This article is not in any list yet, why not save it to one of your lists.
Log in to save this article

Abstract

Background

Community-acquired pneumonia (CAP) is common and a significant cause of mortality. However, CAP surveillance commonly relies on diagnostic codes from electronic health records (EHRs), whose accuracy is imperfect.

Methods

We used Bayesian latent class models to assess the accuracy of CAP diagnostic codes in the absence of a gold standard and to explore the contribution of various EHR data sources in improving CAP identification. Using records from 491,681 hospital admissions in Oxfordshire, UK, from 2016 to 2023, we investigated four EHR-based algorithms for CAP detection based on 1) primary diagnostic codes, 2) clinician-documented indications for antibiotic prescriptions, 3) radiology free-text reports, and 4) vital signs and blood tests.

Results

The estimated prevalence of CAP as the reason for emergency hospital admission was 13.2% (95% credible interval 12.8-13.6%). Primary diagnostic codes had low sensitivity but a high specificity (best fitting model, 0.283 and 0.997 respectively), as did vital signs with blood tests (0.242 and 0.988). Antibiotic indication text had a higher sensitivity (0.603) but a lower specificity (0.981), with radiology reports intermediate (0.493 and 0.959). Defining CAP as present when detected by any algorithm produced sensitivity and specificity of 0.854 and 0.925 respectively. Results remained consistent using alternative priors and in sensitivity analyses.

Conclusion

Relying solely on diagnostic codes for CAP surveillance leads to substantial under-detection; combining EHR data across multiple algorithms enhances identification accuracy. Bayesian latent class analysis-based approaches could improve CAP surveillance and epidemiological estimates by integrating multiple EHR sources, even without a gold standard for CAP diagnosis.

Article activity feed