A novel expectation-maximization approach to infer general diploid selection from time-series genetic data

Adam G. Fine
Matthias Steinrücken

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Detecting and quantifying the strength of selection is a major objective in population genetics. Since selection acts over multiple generations, many approaches have been developed to detect and quantify selection using genetic data sampled at multiple points in time. Such time-series genetic data is commonly analyzed using Hidden Markov Models, but in most cases, under the assumption of additive selection. However, many examples of genetic variation exhibiting non-additive mechanisms exist, making it critical to develop methods that can characterize selection in more general scenarios. Here, we extend a previously introduced expectation-maximization algorithm for the inference of additive selection coefficients to the case of general diploid selection, in which the heterozygote and homozygote fitness are parameterized independently. We furthermore introduce a framework to identify bespoke modes of diploid selection from given data, a heuristic to account for variable population size, and a procedure for aggregating data across linked loci to increase power and robustness. Using extensive simulation studies, we find that our method accurately and efficiently estimates selection coefficients for different modes of diploid selection across a wide range of scenarios; however, power to classify the mode of selection is low unless selection is very strong. We apply our method to ancient DNA samples from Great Britain in the last 4,450 years and detect evidence for selection in six genomic regions, including the well-characterized LCT locus. Our work is the first genome-wide scan characterizing signals of general diploid selection.

Version published to 10.1371/journal.pgen.1011769
Jul 22, 2025
Version published to 10.1101/2024.05.10.593575 on bioRxiv
May 14, 2024

Derivation of prediction error variance for non-genotyped individuals in genomic selection

This article has 3 authors:
1. Vinícius Junqueira
2. Marcos Jun-Iti Yokoo
3. Fernando Flores
This article has no evaluationsLatest version Dec 17, 2025
Reframing Population Genetic Structure as a Quantum Optimization Problem

This article has 1 author:
1. Andrew Davinack
This article has no evaluationsLatest version Dec 24, 2025
Combining genomic prediction and multi-trait indices through stochastic simulations: do index type and deployment order affect genetic gain?

This article has 6 authors:
1. Roberto Fritsche-Neto
2. Lorena Gabriela Coelho Queiroz
3. Jesimiel Viana
4. Kajal Gupta
5. Kashish Grover
6. Júlio César DoVale
This article has no evaluationsLatest version Dec 17, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Derivation of prediction error variance for non-genotyped individuals in genomic selection

Reframing Population Genetic Structure as a Quantum Optimization Problem

Combining genomic prediction and multi-trait indices through stochastic simulations: do index type and deployment order affect genetic gain?