Computable Phenotypes for Respiratory Viral Infections in the All of Us Research Program
Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
Electronic health records (EHRs) contain rich temporal data about infectious diseases, but an optimal approach to identify infections remains undefined. Using the All of Us Research Program, we developed computable phenotypes for respiratory viruses by integrating billing codes, prescriptions, and laboratory results within 90-day episodes. Phenotypes computed from 265,222 participants yielded cohorts ranging from 238 (adenovirus) to 28,729 (SARS-CoV-2) cases. Virus-specific billing codes showed varied sensitivity (8-67%) and high positive predictive value (90-97%), except for influenza virus and SARS-CoV-2 where lower PPV (69-70%) improved with increasing billing codes. Identified infections exhibited expected seasonal patterns and virus proportions when compared with CDC data. This integrated approach identified episodic disease more effectively than individual components alone and demonstrated utility in identifying severe infections. The method enables large-scale studies of host genetics, health disparities, and clinical outcomes across episodic diseases.