Benchmarking of germline copy number variant callers from whole genome sequencing data for clinical applications

Francisco M De La Vega
Sean A Irvine
Pavana Anur
Kelly Potts
Lewis Kraft
Raul Torres
Peter Kang
Sean Truong
Yeonghun Lee
Shunhua Han
Vitor Onuchic
James Han

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Motivation

Whole-genome sequencing (WGS) is increasingly preferred for clinical applications due to its comprehensive coverage, effectiveness in detecting copy number variants (CNVs), and declining costs. However, systematic evaluations of WGS CNV callers tailored to germline clinical testing—where high sensitivity and confirmation of reported CNVs are essential—remain necessary. Clinical reporting typically emphasizes CNVs affecting coding regions over precise breakpoint detection. This study benchmarks several short-read WGS CNV detection tools using reference cell lines to inform their clinical use.

Results

While tools vary in sensitivity (7%–83%) and precision (1%–76%), few meet the sensitivity needed for clinical testing. Callers generally perform better for deletions (up to 88% sensitivity) than duplications (up to 47% sensitivity), with poor detection of duplications under 5 kb. Notably, for CNVs in genes commonly included in clinical panels, significantly improved sensitivity and precision were observed when benchmarking against 25 cell lines with known CNVs. DRAGEN v4.2 high-sensitivity CNV calls, post-processed with custom filters, achieved 100% sensitivity and 77% precision on the optimized gene panel after excluding recurring artifacts. This level of performance may support clinical use with orthogonal confirmation of reportable CNVs, pending validation on laboratory-specific samples.

Availability and implementation

The data underlying this article are available in the European Nucleo-tide Archive under project accession PRJEB87628.

Version published to 10.1093/bioadv/vbaf071
Dec 26, 2024
Version published to 10.1101/2024.07.12.24310338 on medRxiv
Jul 17, 2024

Benchmarking RNA-seq Tools for Real-World Diagnostic Applications

This article has 15 authors:
1. Sarah Silverstein
2. Kaushik Ganapathy
3. Sandra Donkervoort
4. Veronique Bolduc
5. Ying Hu
6. Justin Moy
7. Prech Uapinyoying
8. Svetlana Gorokhova
9. Vijay Ganesh
10. Ben Weisburd
11. Rotem OrBach
12. A. Reghan Foley
13. Pejman Mohammadi
14. David Adams
15. Carsten Bonnemann
This article has no evaluationsLatest version Jan 29, 2026
Capturing clinically actionable copy number alterations in Wilms tumor using nanopore sequencing

This article has 9 authors:
1. Larissa V. Furtado
2. Carolyn Jablonowski
3. Pandurang Kolekar
4. Teresa Santiago
5. Christopher L. Morton
6. Allison Woolard
7. Andrew M. Davidoff
8. Xiaotu Ma
9. Andrew J. Murphy
This article has no evaluationsLatest version Jan 25, 2026
Integrative benchmarking and automation of clonal reconstruction of somatic mutations in single-sample tumor genome analysis

This article has 3 authors:
1. Marina Masliakova
2. Steve Lefever
3. Jo Vandesompele
This article has no evaluationsLatest version Jan 21, 2026

Discuss this preprint

Listed in

Abstract

Motivation

Results

Availability and implementation

Article activity feed

Related articles

Benchmarking RNA-seq Tools for Real-World Diagnostic Applications

Capturing clinically actionable copy number alterations in Wilms tumor using nanopore sequencing

Integrative benchmarking and automation of clonal reconstruction of somatic mutations in single-sample tumor genome analysis