Improving performance of polygenic risk scores for hypertension across two ancestry groups

Marguerite R. Irvin
Vinodh Srinivasasainagendra
Nicole D. Armstrong
Amit Patki
Ulrich Broeckel
Zhe Wang
Leslie A. Lange
Nita A Limdi
Alicia Huerta-Chagoya
Joohyun Kim
Maggie C.Y. Ng
Josep M Mercader
Hemant K. Tiwari

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Polygenic risk score (PRS) methods are evolving, and the benefit of adding functional annotations to the variant weights has been especially promising. However, less attention has been given to how the linkage disequilibrium (LD) reference panel used affects the score performance. In the current study, we compared two Bayesian approaches, one that incorporates functional annotations (LDpred-funct) and one that does not (PRS-CS), extending these applications to the hypertension (HTN) trait across two ancestry groups (European Americans EA, and African Americans, AA). In PRS-CS we used the standard HapMap 3 LD (HM3) reference panel, as well as a modified multi-ancestry reference panel (TagIt) with better coverage of variants from multiple ancestries. Individual-level data in 1,533 EA (58% with HTN) and 8,603 AA (71% with HTN) participants from the Reasons for Geographic and Racial Differences in Stroke Study (REGARDS) was used to optimize scores across the two approaches. PRS performance metrics including R ² and odds ratios (OR) per standard deviation (SD) were then used to assess PRS performance in 1,270 EA (55% with HTN) and 1,896 AA (69% with HTN) participants from the Hypertension Genetic Epidemiology Network Study (HyperGEN). Among EAs in HyperGEN we observed an R ² of 6.0% for LD-Pred-funct and R ² of 7.3% for PRS-CS-TagIt versus R ² of 1.4% for PRS-CS-HM3. The magnitude of the OR per SD for HTN was also higher for PRS-CS-TagIt OR=2.17 (95% CI 1.65-2.85, p=3.0*10 ⁻⁸ ) and LD-Pred-funct OR=2.14 (95% CI 1.61-2.85, p=1.46*10 ⁻⁷ ) versus PRS-CS-HM3 (OR=1.40; 95% CI 10.8-1.82). Among AAs in HyperGEN, the improvements were more modest, where we observed R ² of 1.9% for LD-Pred-funct and R ² of 2.9% for PRS-CS-TagIt versus 0.7% for PRS-CS-HM3. We found that both annotations and the updated LD panel improved the scores in both ancestry groups, but did not make the scores more equitable across the groups.

Author Summary

Genomics can aid in risk prediction and prevention. Polygenic risk score (PRS) development and application is becoming more popular because there’s a lot of genetic data available for training PRS, and they have the potential to be useful in healthcare. However, PRS are not yet widely used in clinics, and the methods are still developing. Early PRS methods only used certain genetic markers, but newer ones use more data and better models to try to predict disease risk more accurately. Still, these tools often don’t work as well for people from underrepresented populations, which could increase health inequalities. Researchers are trying to fix this by using more up-to-date data and adding extra information about how genes function. In our study of HTN, we investigated newer approaches which made PRS more accurate for both for African American individuals and European American individuals—but they didn’t fully close the performance gap between the groups.

Version published to 10.1101/2025.11.05.25339527 on medRxiv
Nov 7, 2025

Within-family validation of a new polygenic predictor of general cognitive ability

This article has 6 authors:
1. Tobias Wolfram
2. Spencer Moore
3. Jeremiah H. Li
4. Jonathan Anomaly
5. Ivan Davidson
6. Michael Christensen
This article has no evaluationsLatest version Dec 11, 2025
Application of longitudinal follow-up data increases power in the identification of genetic loci for type 2 diabetes

This article has 1 author:
1. Seong Beom Cho
This article has no evaluationsLatest version Dec 18, 2025
Leveraging the shared and opposing genetic mechanisms in the heritable cardiomyopathies

This article has 68 authors:
1. Daria Kramarenko
2. Poeya Haydarlou
3. George Powell
4. Joel Rämö
5. Riyad Janan
6. Claire Prince
7. Dominic Zimmerman
8. Pantazis Theotokis
9. Prisca Thami
10. Jan Haas
11. Sophie Garnier
12. Frank Rühle
13. Edwin Poel
14. Amand Schmidt
15. Sharlene Day
16. Adam Helms
17. Rachel Lampert
18. Victoria Parikh
19. Jodie Ingles
20. Neal Lakdawala
21. Anjali Owens
22. Sara Saberi
23. John Stendhal
24. Euan Ashley
25. Belinda Gray
26. Mark Russell
27. Thomas Ryan
28. Joseph Rossano
29. Dominic Abrams
30. Iacopo Olivotto
31. Erin Miller
32. Kimberly Lin
33. Niccolo Maurizi
34. Alessia Argiro
35. Colin Berry
36. Rob Cooper
37. Andrew Flett
38. Roy Gardner
39. John Greenwood
40. Brian Halliday
41. David Hutchings
42. Masliza Mahmod
43. Gerry McCann
44. Stephen Page
45. Charles Peebles
46. Betty Raman
47. Peter Swoboda
48. Amanda Varnava
49. David Wright
50. Sanjay Prasad
51. Stuart Cook
52. Upsala Tayal
53. Rachel Buchan
54. Roddy Walsh
55. Arthur Wilde
56. Benjamin Meder
57. Philippe Charron
58. Anuj Goel
59. Ahmad Amin
60. Patrick Ellinor
61. Krishna Aragam
62. Rafik Tadros
63. Yigal Pinto
64. Carolyn Ho
65. Hugh Watkins
66. James Ware
67. Connie Bezzina
68. Sean Jurgens
This article has no evaluationsLatest version Jan 27, 2026

Discuss this preprint

Listed in

Abstract

Author Summary

Article activity feed

Related articles

Within-family validation of a new polygenic predictor of general cognitive ability

Application of longitudinal follow-up data increases power in the identification of genetic loci for type 2 diabetes

Leveraging the shared and opposing genetic mechanisms in the heritable cardiomyopathies