GADMA: Genetic algorithm for inferring demographic history of multiple populations from allele frequency spectrum data

This article has been Reviewed by the following groups

Read the full article

Abstract

Background

The demographic history of any population is imprinted in the genomes of the individuals that make up the population. One of the most popular and convenient representations of genetic information is the allele frequency spectrum (AFS), the distribution of allele frequencies in populations. The joint AFS is commonly used to reconstruct the demographic history of multiple populations, and several methods based on diffusion approximation (e.g., ∂a∂i) and ordinary differential equations (e.g., moments) have been developed and applied for demographic inference. These methods provide an opportunity to simulate AFS under a variety of researcher-specified demographic models and to estimate the best model and associated parameters using likelihood-based local optimizations. However, there are no known algorithms to perform global searches of demographic models with a given AFS.

Results

Here, we introduce a new method that implements a global search using a genetic algorithm for the automatic and unsupervised inference of demographic history from joint AFS data. Our method is implemented in the software GADMA (Genetic Algorithm for Demographic Model Analysis, https://github.com/ctlab/GADMA).

Conclusions

We demonstrate the performance of GADMA by applying it to sequence data from humans and non-model organisms and show that it is able to automatically infer a demographic model close to or even better than the one that was previously obtained manually. Moreover, GADMA is able to infer multiple demographic models at different local optima close to the global one, providing a larger set of possible scenarios to further explore demographic history.

Article activity feed

  1. Now published in GigaScience doi: 10.1093/gigascience/giaa005

    Ekaterina Noskova 1ITMO University, St. Petersburg, Russia2JetBrains Research, St. Petersburg, RussiaFind this author on Google ScholarFind this author on PubMedSearch for this author on this siteFor correspondence: ekaterina.e.noskova@gmail.comVladimir Ulyantsev 1ITMO University, St. Petersburg, RussiaFind this author on Google ScholarFind this author on PubMedSearch for this author on this siteKlaus-Peter Koepfli 3Theodosius Dobzhansky Center for Genome Bioinformatics, Saint Petersburg State University, St. Petersburg, Russia5National Zoological Park, Smithsonian Conservation Biology Institute, Washington DC, USAFind this author on Google ScholarFind this author on PubMedSearch for this author on this siteStephen J. O’Brien 3Theodosius Dobzhansky Center for Genome Bioinformatics, Saint Petersburg State University, St. Petersburg, Russia4Oceanographic Center, Nova Southeastern University Ft Lauderdale, Ft Lauderdale, USAFind this author on Google ScholarFind this author on PubMedSearch for this author on this sitePavel Dobrynin 3Theodosius Dobzhansky Center for Genome Bioinformatics, Saint Petersburg State University, St. Petersburg, Russia5National Zoological Park, Smithsonian Conservation Biology Institute, Washington DC, USAFind this author on Google ScholarFind this author on PubMedSearch for this author on this site

    A version of this preprint has been published in the Open Access journal GigaScience (see paper https://doi.org/10.1093/gigascience/giaa005 ), where the paper and peer reviews are published openly under a CC-BY 4.0 license.

    These peer reviews were as follows:

    Reviewer 1: http://dx.doi.org/10.5524/REVIEW.102116 Reviewer 2: http://dx.doi.org/10.5524/REVIEW.102117