Limits and convergence properties of the sequentially Markovian coalescent

Thibaut Paul Patrick Sellinger
Diala Abu‐Awad
Aurélien Tellier

This article has been Reviewed by the following groups

Read the full article

Listed in

Evaluated articles (Peer Community in Evolutionary Biology)

Abstract

Several methods based on the sequentially Markovian coalescent (SMC) make use of full genome sequence data from samples to infer population demographic history including past changes in population size, admixture, migration events and population structure. More recently, the original theoretical framework has been extended to allow the simultaneous estimation of population size changes along with other life history traits such as selfing or seed banking. The latter developments enhance the applicability of SMC methods to nonmodel species. Although convergence proofs have been given using simulated data in a few specific cases, an in‐depth investigation of the limitations of SMC methods is lacking. In order to explore such limits, we first develop a tool inferring the best case convergence of SMC methods assuming the true underlying coalescent genealogies are known. This tool can be used to quantify the amount and type of information that can be confidently retrieved from given data sets prior to the analysis of the real data. Second, we assess the inference accuracy when the assumptions of SMC approaches are violated due to departures from the model, namely the presence of transposable elements, variable recombination and mutation rates along the sequence, and SNP calling errors. Third, we deliver a new interpretation of SMC methods by highlighting the importance of the transition matrix, which we argue can be used as a set of summary statistics in other statistical inference methods, uncoupling the SMC from hidden Markov models (HMMs). We finally offer recommendations to better apply SMC methods and build adequate data sets under budget constraints.

Version published to 10.1111/1755-0998.13416
May 30, 2021
Peer Community in Evolutionary Biology
Nov 12, 2020

Read the original source
Version published to 10.1101/2020.07.23.217091v3 on bioRxiv
Nov 10, 2020
Version published to 10.1101/2020.07.23.217091v2 on bioRxiv
Sep 21, 2020
Version published to 10.1101/2020.07.23.217091v1 on bioRxiv
Jul 24, 2020

Coalescence and Translation: A Language Model for Population Genetics

This article has 5 authors:
1. Kevin Korfmann
2. Nathaniel S. Pope
3. Melinda Meleghy
4. Auélien Tellier
5. Andrew D. Kern
This article has no evaluationsLatest version Jun 27, 2025
Can ancient DNA and other forms of time-sampled data aid in the inference of negative frequency dependent selection?

This article has 1 author:
1. Vivak Soni
This article has no evaluationsLatest version May 24, 2025
Improving the Scalability of Bayesian Phylodynamic Inference through Efficient MCMC Proposals

This article has 4 authors:
1. Remco Bouckaert
2. Paula Weidemüller
3. Luis Esquivel Gomez
4. Nicola Felix Müller
This article has no evaluationsLatest version Jun 24, 2025

This article has been Reviewed by the following groups

Listed in

Abstract

Article activity feed

Related articles

Coalescence and Translation: A Language Model for Population Genetics

Can ancient DNA and other forms of time-sampled data aid in the inference of negative frequency dependent selection?

Improving the Scalability of Bayesian Phylodynamic Inference through Efficient MCMC Proposals