Polygenic prediction of phenotypes with a neural empirical Bayes approach

Joshua Weinstock
April Kim
Alexis Battle

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Polygenic risk scores (PRS) estimate the expected value of a phenotype based on individual genotypes. Although statistical approaches for calculating PRS have advanced considerably in recent years, few methods incorporate recently generated functional genomics atlases to improve SNP weight estimation. Here, we introduce PRS with a Functional Neural Network (PRSFNN) - a novel approach which uses a neural network in an empirical Bayesian framework to learn the links between SNP functional annotations and SNP weights. By learning these links with a neural network, PRSFNN is able to learn complex, non-linear functions of annotations with minimal assumptions. After curating extensive annotations, including ancestry-stratified allele frequencies, chromatin accessibility across hundreds of developmental and adult cell types, transcription factor binding from ENCODE4, quantitative trait loci, and sequence conservation from Zoonomia, we evaluated PRSFNN on 18 continuous complex traits in the UK Biobank. After benchmarking against other leading PRS methods in an out-of-sample test set, we find that PRSFNN outperforms other PRS methods on 17 of 18 traits. Finally, we show that a low-density lipoprotein PRS estimated with PRSFNN outperforms other PRS methods in the prediction of incident cardio-vascular disease. Overall, PRSFNN uses a curated SNP annotation atlas within a neural empirical Bayesian framework to achieve state-of-the-art performance, advancing our ability to predict phenotypic variation from genetic variation.

Version published to 10.1101/2025.07.21.25331918 on medRxiv
Jul 22, 2025

Bayesian fine-mapping pinpoints candidate genes and pleiotropic loci of production traits from a chicken backcrossing scheme

This article has 8 authors:
1. Chi Mei Sun
2. Johannes Geibel
3. Henner Simianer
4. Björn Andersson
5. David Cavero
6. Rudolf Preisinger
7. Steffen Weigend
8. Christian Reimer
This article has no evaluationsLatest version Jan 13, 2026
Impact of scale parameter for marker variance prior in some Bayesian whole-genome regression methods

This article has 2 authors:
1. Özge KOZAKLI
2. Ayhan CEYHAN
This article has no evaluationsLatest version Jan 20, 2026
Decoding Complex Genotype-Phenotype Interactions by Discretizing the Genome

This article has 6 authors:
1. Jędrzej Kubica
2. Hetvi Jethwani
3. Krzysztof H. Banecki
4. Mauricio Moldes
5. Dariusz Plewczynski
6. Ben Busby
This article has no evaluationsLatest version Dec 17, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Bayesian fine-mapping pinpoints candidate genes and pleiotropic loci of production traits from a chicken backcrossing scheme

Impact of scale parameter for marker variance prior in some Bayesian whole-genome regression methods

Decoding Complex Genotype-Phenotype Interactions by Discretizing the Genome