Advancing Precision Health Discovery in a Genetically Diverse Health System
Discuss this preprint
Start a discussion What are Sciety discussions?Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
Linking genetic data with electronic health records in hospital biobanks promises to advance precision medicine, but limited ancestral diversity constrains discovery and generalizability. We analyzed 93,936 participants from the UCLA ATLAS Community Health Initiative to inform disease prevalence and genetic risk across five continental and 36 fine-scale ancestry groups. We discovered numerous unreported gene-phenotype associations, including FN3K with intestinal disaccharidase deficiency in Europeans and admixed Americans. Polygenic scores (PGS) robustly predicted common diseases, with effects markedly diminished in non-Europeans. Furthermore, we reduced the pronounced European bias in curated clinical variants using computational predictors, uncovering unreported disease-gene associations, including ANKZF1 and peripheral vascular disease in AFR. Longitudinal data revealed that semaglutide efficacy varies across ancestries, is associated with PGS for type 2 diabetes, and is modulated by genetic variation in PTPRU . These findings illustrate how ancestrally diverse biobanks from a single health system yield robust disease associations and pharmacogenomic insights.