Comparing Neanderthal introgression maps reveals core agreement but substantial heterogeneity

Read the full article See related articles

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.
Log in to save this article

Abstract

Statistical methods to identify Neanderthal ancestry in modern human genomes rest on varying assumptions and inputs. Nonetheless, most studies of introgression use only a single method to define Neanderthal ancestry. Due to a lack of “ground truth,” we have a limited understanding of the accuracy, comparative strengths and weaknesses, and the sensitivity of downstream conclusions for these methods. Here, we performed large-scale comparisons of genome-wide introgression maps from 12 representative Neanderthal introgression detection algorithms. These span methods that consider archaic and human reference genomes not from Africa (ArchaicSeeker2, CRF, DICAL-ADMIX), only archaic genomes (S*, Sprime, HMM, SARGE, ARGWeaver-D), only human reference genomes, including from Africa (IBDmix), or simulated data (ArchIE). Our results highlight a core set of regions predicted by nearly all methods, as well as substantial heterogeneity in commonly used Neanderthal introgression maps. Furthermore, we find that downstream analyses may result in different conclusions depending on the map used. Thus, we recommend careful consideration of map(s) chosen for an analysis and support the use of multiple maps to ensure robustness of conclusions. We make integrated prediction sets available, enabling further understanding of Neanderthal introgression’s legacy on modern humans.

Article activity feed