KLinterSel: Intersection among candidates of different selective sweep detection methods
Discuss this preprint
Start a discussion What are Sciety discussions?Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
Studies aiming to detect signals of selection in genomes often apply multiple methods to increase confidence in their results, typically selecting genomic regions that overlap across approaches. However, such overlap can be misleading when the genomic regions under study are not independent. In these cases, coincident candidates may arise from the structure of the data itself rather than from true methodological robustness. To address this issue, we present a statistical test that compares, for a given set of SNPs, the observed distance profile between candidate sites detected by different methods with the distance profile expected by chance for the same dataset. This test is implemented in the KLinterSel program, which additionally identifies clusters of sites jointly detected by several methods within a user-defined distance threshold. As a proof of concept, we applied KLinterSel to evaluate the overlap among candidates from four selection-detection methods investigating divergent selection associated with resistance to the parasite Marteilia cochillia in the common cockle ( Cerastoderma edule ). KLinterSel statistically evaluates and visualizes the agreement between observed and expected-by-chance distance profiles. It uses Python’s numerical libraries and vectorized operations for computational efficiency and includes multi-process parallelization options for memory-intensive datasets. Source code and documentation are available on GitHub ( https://github.com/noosdev0/KLinterSel ), and pre-built binaries for Windows, Linux, and macOS (arm64) facilitate broad accessibility.