Tspecies , Rapid Optimization for Estimating Species Divergence Time Using K s Distribution
Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
K s distribution, the distribution of the synonymous substitutions, has been widely used to estimate the species divergence using orthologous genes. However, conventional approaches often ignore the underlying bias that species divergence is delayed to average gene divergence by 2 N e generations, where N e represents the ancestral effective population size, due to the lack of scalable methods for N e inference. Here, we demonstrate through simulations that K s distribution variance correlates with N e , enabling direct estimation of ancestral population parameters from standard K s data. Leveraging this relationship, we present Tspecies , an R package that corrects divergence time estimates using only substitution rates and K s distributions, without requiring additional genomic data. Our practical application of Tspecies in Liriodendron has inferred a divergence time between North American and East Asian lineages (3.49 Ma) that align with late Pliocene cooling, and a large ancestral N e (∼5 × 10 5 ) consistent with fossil evidence. By incorporating a readily estimated N e , our tool resolves a long-standing bias in K s -based dating while maintaining computational efficiency and broad applicability.
Tspecies is freely available under an MIT license at https://github.com/limj0987/Tspecies.git .