The spatiotemporal patterns of major human admixture events during the European Holocene

Curation statements for this article:
  • Curated by eLife

    eLife logo

    Evaluation Summary:

    This manuscript presents DATES, a method to infer the timing of admixture events using genetic data from present-day or ancient individuals. This is a robust method that is useful in the field of paleogenomics and outperforms existing methods. In this manuscript, DATES is applied to >1000 ancient human genomes to characterize major admixture events during the European Holocene. This work will be of interest to scholars in the fields of population genetics, paleogenomics, archeology, biological anthropology, and history.

    (This preprint has been reviewed by eLife. We include the public reviews from the reviewers here; the authors also receive private feedback with suggested changes to the manuscript. The reviewers remained anonymous to the authors.)

This article has been Reviewed by the following groups

Read the full article See related articles

Abstract

Recent studies have shown that admixture has been pervasive throughout human history. While several methods exist for dating admixture in contemporary populations, they are not suitable for sparse, low coverage ancient genomic data. Thus, we developed DATES (Distribution of Ancestry Tracts of Evolutionary Signals) that leverages ancestry covariance patterns across the genome of a single individual to infer the timing of admixture. DATES provides reliable estimates under various demographic scenarios and outperforms available methods for ancient DNA applications. Using DATES on~1100 ancient genomes from sixteen regions in Europe and west Asia, we reconstruct the chronology of the formation of the ancestral populations and the fine-scale details of the spread of Neolithic farming and Steppe pastoralist-related ancestry across Europe. By studying the genetic formation of Anatolian farmers, we infer that gene flow related to Iranian Neolithic farmers occurred before 9600 BCE, predating the advent of agriculture in Anatolia. Contrary to the archaeological evidence, we estimate that early Steppe pastoralist groups (Yamnaya and Afanasievo) were genetically formed more than a millennium before the start of Steppe pastoralism. Our analyses provide new insights on the origins and spread of farming and Indo-European languages, highlighting the power of genomic dating methods to elucidate the legacy of human migrations.

Article activity feed

  1. Evaluation Summary:

    This manuscript presents DATES, a method to infer the timing of admixture events using genetic data from present-day or ancient individuals. This is a robust method that is useful in the field of paleogenomics and outperforms existing methods. In this manuscript, DATES is applied to >1000 ancient human genomes to characterize major admixture events during the European Holocene. This work will be of interest to scholars in the fields of population genetics, paleogenomics, archeology, biological anthropology, and history.

    (This preprint has been reviewed by eLife. We include the public reviews from the reviewers here; the authors also receive private feedback with suggested changes to the manuscript. The reviewers remained anonymous to the authors.)

  2. Reviewer #1 (Public Review):

    Chintalapati et al. present DATES, a method that leverages ancestry covariance patterns across the genome of a single individual to infer the timing of admixture events. Authors perform simulations of demographic models that mimic key events in European prehistory, as well as the characteristics of ancient DNA (aDNA) datasets (e.g., pseudo-haploid calls and a large proportion of missing data). Based on these simulations, the authors assess the performance of DATES across a range of admixture proportions, timeframes, the proportion of missing data, and different numbers of populations as surrogate parental groups. One of the major strengths of this manuscript is that it presents a very robust method that is broadly useful in paleogenomic studies and that fills an important gap in the field (previous LD- and haplotype-based methods do not perform as well with aDNA datasets).

    The method is applied to >1000 ancient human genomes (publicly available) to characterize major admixture events during the European Holocene. The manuscript addresses long-standing questions in the field, for instance, the genetic formation of Anatolian farmers and Steppe pastoralists, the chronology of the neolithization of Europe, and the spread of Steppe ancestry across Europe, among others. The authors do a very good job in synthesizing known and new observations into a comprehensive and exciting story - this is particularly impressive given the breadth of events that this work covers.

    The authors' claims are fully justified by their results. The manuscript is very easy to read, and the results are presented in a very clear way.

  3. Reviewer #2 (Public Review):

    Chintalapati et al. present an updated and refined version of the software DATES, for estimating the number of generations since admixture in either a single diploid individual or a pool of admixed samples. They show via extensive simulations that DATES performs well under many different individual conditions; including small samples sizes, missing data or pseudo-haploid genotype calling. The authors then systematically applied DATES to a large dataset of published ancient human SNP capture data, to investigate the timings of admixture between populations in Holocene West Eurasia.

    Through simulations, the authors convincingly demonstrate that DATES outperforms comparable methods, and produces accurate inference under many challenging circumstances. However, the strength of the empirical analyses is undermined by the lack of simulations which explicitly test the performance of DATES under the combination of conditions present in the empirical data. Many of the empirical analyses contain all of the problematic features that are only tested in isolation in the simulations-e.g., small sample sizes in both the target and reference populations, with non-contemporaneous sampling, variable data missingness, and pseudo-haploid genotypes. For example, in Figure 1b the authors show the effects of pseudo-haploid genotypes and data missingness for a target population of size n=10; however, this is substantially larger than the average target population used in the empirical analyses, and does not test the combined effects of small sample sizes and poor data quality in the reference populations. The authors also show that the performance of DATES is sensitive to low Fst between the admixing populations, but do not provide simulations calibrated to the levels of Fst between the empirical reference populations. These issues make it difficult to interpret the robustness of the empirical analyses, despite the large number of simulations presented.

    Overall, the paper is well written, and the authors do a good job of presenting a complex series of results. The software DATES represents a significant improvement over comparable methods and has the potential to make substantial contributions to the study of admixture history in many species.