Individualized Uncertainty Quantification in Polygenic Risk Scores Using Conformalized Quantile Regression

Read the full article See related articles

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.
Log in to save this article

Abstract

Polygenic risk scores (PRS) are widely used in post-GWAS analyses to predict complex traits across humans, animals, and plants. While significant progress has been made in developing new PRS methods, much less attention has been given to quantifying the uncertainty associated with these predictions. In this work, we propose a method for individualized uncertainty quantification based on quantile regression. When paired with conformal prediction, this approach enables the construction of prediction intervals with guaranteed coverage, offering lower and upper bounds within which the phenotype is likely to fall with high probability. We apply this framework to data from the UK Biobank and the ProgeNIA/SardiNIA studies, showing that the resulting prediction intervals: (1) maintain valid coverage under minimal model assumptions, (2) provide more realistic individualized estimates of uncertainty by allowing for asymmetry and individual-specific interval lengths, and (3) exhibit reduced uncertainty compared to existing methods. Overall, we present a novel framework for individualized uncertainty quantification in PRS analyses and highlight the importance of incorporating uncertainty into predictive modeling.

Article activity feed