Duphold: scalable, depth-based annotation and curation of high-confidence structural variant calls

Brent S Pedersen
Aaron R Quinlan

This article has been Reviewed by the following groups

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

Evaluated articles (GigaScience)

Abstract

Most structural variant (SV) detection methods use clusters of discordant read-pair and split-read alignments to identify variants yet do not integrate depth of sequence coverage as an additional means to support or refute putative events. Here, we present "duphold," a new method to efficiently annotate SV calls with sequence depth information that can add (or remove) confidence to SVs that are predicted to affect copy number. Duphold indicates not only the change in depth across the event but also the presence of a rapid change in depth relative to the regions surrounding the break-points. It uses a unique algorithm that allows the run time to be nearly independent of the number of variants. This performance is important for large, jointly called projects with many samples, each of which must be evaluated at thousands of sites. We show that filtering on duphold annotations can greatly improve the specificity of SV calls. Duphold can annotate SV predictions made from both short-read and long-read sequencing datasets. It is available under the MIT license at https://github.com/brentp/duphold.

GigaScience
Jan 23, 2022

Now published in GigaScience doi: 10.1093/gigascience/giz040

Brent S. Pedersen 1Department of Human Genetics, University of Utah. Salt Lake City, UT3USTAR Center for Genetic Discovery, University of Utah. Salt Lake City, UTFind this author on Google ScholarFind this author on PubMedSearch for this author on this siteORCID record for Brent S. PedersenAaron R. Quinlan 1Department of Human Genetics, University of Utah. Salt Lake City, UT2Department of Biomedical Informatics, University of Utah. Salt Lake City, UT3USTAR Center for Genetic Discovery, University of Utah. Salt Lake City, UTFind this author on Google ScholarFind this author on PubMedSearch for this author on this siteORCID record for Aaron R. Quinlan

A version of this preprint has been published in the Open Access journal GigaScience (see paper https://doi.org/10.1093/gigascie…

Now published in GigaScience doi: 10.1093/gigascience/giz040

Brent S. Pedersen 1Department of Human Genetics, University of Utah. Salt Lake City, UT3USTAR Center for Genetic Discovery, University of Utah. Salt Lake City, UTFind this author on Google ScholarFind this author on PubMedSearch for this author on this siteORCID record for Brent S. PedersenAaron R. Quinlan 1Department of Human Genetics, University of Utah. Salt Lake City, UT2Department of Biomedical Informatics, University of Utah. Salt Lake City, UT3USTAR Center for Genetic Discovery, University of Utah. Salt Lake City, UTFind this author on Google ScholarFind this author on PubMedSearch for this author on this siteORCID record for Aaron R. Quinlan

A version of this preprint has been published in the Open Access journal GigaScience (see paper https://doi.org/10.1093/gigascience/giz040 ), where the paper and peer reviews are published openly under a CC-BY 4.0 license.

These peer reviews were as follows:

Reviewer 1: http://dx.doi.org/10.5524/REVIEW.101641 Reviewer 2: http://dx.doi.org/10.5524/REVIEW.101642

Read the original source
Version published to 10.1093/gigascience/giz040
Apr 1, 2019
Version published to 10.1101/465385 on bioRxiv
Nov 8, 2018

META-DIFF: a k-mer-based pipeline that detects differentially abundant sequences in metagenomics whole genome sequencing

This article has 8 authors:
1. Louis-Maël Guéguen
2. Alban Mathieu
3. Simon Pelletier
4. Anthony Woo
5. Namita Misra
6. Magali Moreau
7. Olivier Perin
8. Arnaud Droit
This article has no evaluationsLatest version Jan 29, 2026
Enhancing variant detection in complex genomes: leveraging linked reads for robust SNP, Indel, and structural variant analysis

This article has 7 authors:
1. Can Luo
2. Yichen Liu
3. Han Liu
4. Zhenmiao Zhang
5. Lu Zhang
6. Brock Peters
7. Xin Maizie Zhou
This article has no evaluationsLatest version Jan 12, 2026
GTcomplex: Spatial indexing-powered search and alignment of macromolecular complexes

This article has 1 author:
1. Mindaugas Margelevicius
This article has no evaluationsLatest version Jan 22, 2026

This article has been Reviewed by the following groups

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

META-DIFF: a k-mer-based pipeline that detects differentially abundant sequences in metagenomics whole genome sequencing

Enhancing variant detection in complex genomes: leveraging linked reads for robust SNP, Indel, and structural variant analysis

GTcomplex: Spatial indexing-powered search and alignment of macromolecular complexes