Structural variation in 1,019 diverse humans based on long-read sequencing

Siegfried Schloissnig
Samarendra Pani
Jana Ebler
Carsten Hain
Vasiliki Tsapalou
Arda Söylev
Patrick Hüther
Hufsah Ashraf
Timofey Prodanov
Mila Asparuhova
Hugo Magalhães
Wolfram Höps
Jesus Emiliano Sotelo-Fonseca
Tomas Fitzgerald
Walter Santana-Garcia
Ricardo Moreira-Pinhal
Sarah Hunt
Francy J. Pérez-Llanos
Tassilo Erik Wollenweber
Sugirthan Sivalingam
Dagmar Wieczorek
Mario Cáceres
Christian Gilissen
Ewan Birney
Zhihao Ding
Jan Nygaard Jensen
Nikhil Podduturi
Jan Stutzki
Bernardo Rodriguez-Martin
Tobias Rausch
Tobias Marschall
Jan O. Korbel

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Genomic structural variants (SVs) contribute substantially to genetic diversity and human diseases ^1–4 , yet remain under-characterized in population-scale cohorts ⁵ . Here we conducted long-read sequencing ⁶ in 1,019 humans to construct an intermediate-coverage resource covering 26 populations from the 1000 Genomes Project. Integrating linear and graph genome-based analyses, we uncover over 100,000 sequence-resolved biallelic SVs and we genotype 300,000 multiallelic variable number of tandem repeats ⁷ , advancing SV characterization over short-read-based population-scale surveys ^3,4 . We characterize deletions, duplications, insertions and inversions in distinct populations. Long interspersed nuclear element-1 (L1) and SINE-VNTR-Alu (SVA) retrotransposition activities mediate the transduction ^8,9 of unique sequence stretches in 5′ or 3′, depending on source mobile element class and locus. SV breakpoint analyses point to a spectrum of homology-mediated processes contributing to SV formation and recurrent deletion events. Our open-access resource underscores the value of long-read sequencing in advancing SV characterization and enables guiding variant prioritization in patient genomes.

Version published to 10.1038/s41586-025-09290-7
Jul 23, 2025
Version published to 10.1101/2024.04.18.590093 on bioRxiv
Apr 20, 2024

Enhancing variant detection in complex genomes: leveraging linked reads for robust SNP, Indel, and structural variant analysis

This article has 7 authors:
1. Can Luo
2. Yichen Liu
3. Han Liu
4. Zhenmiao Zhang
5. Lu Zhang
6. Brock Peters
7. Xin Maizie Zhou
This article has no evaluationsLatest version Jan 12, 2026
Shotgun metagenomics: a deep insight into the composition and function of the complex microbial world

This article has 7 authors:
1. Grazia Visci
2. Elisabetta Notario
3. Giuseppe Defazio
4. Mariano Francesco Caratozzolo
5. Bruno Fosso
6. Marinella Marzano
7. Graziano Pesole
This article has no evaluationsLatest version Jan 30, 2026
A Benchmarking Framework to Catalyze Individual Human Genome Projects

This article has 3 authors:
1. Manjushri kalpande
2. Apoorva Ganesh
3. Subhashini Srinivasan
This article has no evaluationsLatest version Dec 17, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Enhancing variant detection in complex genomes: leveraging linked reads for robust SNP, Indel, and structural variant analysis

Shotgun metagenomics: a deep insight into the composition and function of the complex microbial world

A Benchmarking Framework to Catalyze Individual Human Genome Projects