KLinterSel: Intersection among candidates of different selective sweep detection methods

Antonio Carvajal-Rodríguez
Sara Rocha
Marina Pampín
Paulino Martínez
Armando Caballero

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Studies aiming to detect signals of selection in genomes often apply multiple methods to increase confidence in their results, typically selecting genomic regions that overlap across approaches. However, such overlap can be misleading when the genomic regions under study are not independent. In these cases, coincident candidates may arise from the structure of the data itself rather than from true methodological robustness. To address this issue, we present a statistical test that compares, for a given set of SNPs, the observed distance profile between candidate sites detected by different methods with the distance profile expected by chance for the same dataset. This test is implemented in the KLinterSel program, which additionally identifies clusters of sites jointly detected by several methods within a user-defined distance threshold. As a proof of concept, we applied KLinterSel to evaluate the overlap among candidates from four selection-detection methods investigating divergent selection associated with resistance to the parasite Marteilia cochillia in the common cockle ( Cerastoderma edule ). KLinterSel statistically evaluates and visualizes the agreement between observed and expected-by-chance distance profiles. It uses Python’s numerical libraries and vectorized operations for computational efficiency and includes multi-process parallelization options for memory-intensive datasets. Source code and documentation are available on GitHub ( https://github.com/noosdev0/KLinterSel ), and pre-built binaries for Windows, Linux, and macOS (arm64) facilitate broad accessibility.

Version published to 10.1101/2025.08.21.671449 on bioRxiv
Aug 25, 2025

Genetic estimates of relatedness: Established practices and new opportunities through low coverage whole genome sequencing

This article has 8 authors:
1. Annika Freudiger
2. Natalie Kestel
3. Vladimir Jovanovic
4. Mariana Madruga de Brito
5. Angelina Ruiz-Lambides
6. Katja Nowick
7. Anja Widdig
8. Harald Ringbauer
This article has no evaluationsLatest version Jan 23, 2026
Comparison of BLUPF90IOD3 and MiXBLUP implementations of the single-step model applied to the Polish national dairy cattle evaluation

This article has 4 authors:
1. Dawid Słomian
2. Michalina Jakimowicz
3. Tomasz Suchocki
4. Joanna Szyda
This article has no evaluationsLatest version Dec 22, 2025
GenBlosum: On Determining Whether Cancer Mutations Are Functional or Random

This article has 2 authors:
1. Alejandro Leyva
2. Muhammad Khalid Khan Niazi
This article has no evaluationsLatest version Dec 15, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Genetic estimates of relatedness: Established practices and new opportunities through low coverage whole genome sequencing

Comparison of BLUPF90IOD3 and MiXBLUP implementations of the single-step model applied to the Polish national dairy cattle evaluation

GenBlosum: On Determining Whether Cancer Mutations Are Functional or Random