Delphi: Deep Learning for Polygenic Risk Prediction

Costa Georgantas
Zoltan Kutalik
Jonas Richiardi

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Polygenic scores (PGS) are relative measures of an individual's genetic propensity to a particular trait or disease. Most PGS methods use a regression framework for polygenic modeling and assume that mutation effect estimates are constant across individuals. While these assumptions simplify computation, they increase error, and PGS are particularly less predictive for under-represented genetic ancestries. We developed and provide Delphi (deep learning for phenotype inference), an individual-level deep-learning method that relaxes these assumptions to produce more predictive PGS. Delphi can integrate up to hundreds of thousands of SNPs as input and model non-linear SNP-SNP and SNP-covariate interactions. We compare our results with linear PGS models and a gradient-boosted trees-based method. We show that deep learning can be an effective approach to genetic risk prediction. We report substantial performance gains for a broad range of continuous phenotypes compared to the state-of-the-art. Furthermore, we show that Delphi tends to increase the weight of high-effect mutations. This work demonstrates an effective deep learning method for modeling genetic risk that also generalizes well when evaluated on individuals from non-European ancestries.

Version published to 10.1101/2024.04.19.24306079 on medRxiv
Apr 20, 2024

Triangulated causal inference with deep counterfactual learningfor individualized statin-associated type 2 diabetes risk

This article has 13 authors:
1. Hao Zhou
2. Jorge Passamani Zubelli
3. Haralampos Hatzikirou
4. Andreas Henschel
5. Laurent Alain Najman
6. Daniel E. Platt
7. Antonello Maruotti
8. Siobhan O’Sullivan
9. Lithe Basbous
10. Cynthia Al Hageh
11. Mariam AlHarbi
12. Antoine Abchee
13. Pierre Zalloua
This article has no evaluationsLatest version Jan 27, 2026
Multi-View Autoencoder Framework with Feature Recalibration and Ensemble Learning for Predicting Heart Disease

This article has 2 authors:
1. Abulfadhel Amer Saihood Altufaili
2. Dunya Mohammed Shleej
This article has no evaluationsLatest version Dec 11, 2025
Within-family validation of a new polygenic predictor of general cognitive ability

This article has 6 authors:
1. Tobias Wolfram
2. Spencer Moore
3. Jeremiah H. Li
4. Jonathan Anomaly
5. Ivan Davidson
6. Michael Christensen
This article has no evaluationsLatest version Dec 11, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Triangulated causal inference with deep counterfactual learningfor individualized statin-associated type 2 diabetes risk

Multi-View Autoencoder Framework with Feature Recalibration and Ensemble Learning for Predicting Heart Disease

Within-family validation of a new polygenic predictor of general cognitive ability