The signal of admixture can decay rapidly when using clustering-based methods
Discuss this preprint
Start a discussion What are Sciety discussions?Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
Gene flow shapes evolutionary trajectories by introducing novel alleles, facilitating or retarding adaptation, or eroding divergence among populations. Studies commonly infer gene flow through estimates of genetic admixture from methods that cluster individuals by allele-frequency similarity. However, the ability of these methods to reliably detect historical admixture remains poorly understood, particularly under spatially restricted dispersal and across a range of migration rates. Here we evaluate how signals of admixture arise and decay during and after gene flow across a range of within- and between-population dispersal rates, using three common inference methods: ADMIXTURE, sNMF, and PopCluster. Using forward-time simulations in a two-dimensional landscape, we modelled two metapopulations that diverged in isolation, experienced a pulse of secondary contact, and subsequently returned to isolation. All three methods systematically underestimated admixture, with the bias increasing once gene flow ceased: inferred admixture averaged 56% of the true value at the end of gene flow, declined to 23% after 1,000 generations, and fell below 5% by 2,000 generations, even where true admixture exceeded 20%. This underestimation was accompanied by corresponding overestimation of barrier strength, assessed using a metric analogous to cline width. Both patterns reflect the rapid re-establishment of Hardy–Weinberg and linkage equilibrium following secondary contact, which erodes the allele-frequency differences upon which these methods depend. Consequently, the absence of inferred admixture in present-day samples should not be taken as evidence against historical gene flow.