Benchmarking scRNA-seq Copy Number Inference: A Comprehensive Evaluation and Practitioner’s Guide

Hung-Ching Chang
Yuxin Shi
Haoyu Cheng
Jian Zou
Alexander Chih-Chieh Chang
Brent T. Schlegel
Wenjia Wang
Daniel D. Brown
Fangyuan Chen
Sarah Wang
Danyang Li
Ria Sai
Noelle Michel
Steffi Oesterreich
Adrian V. Lee
George C. Tseng

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Accurately inferring copy number variation (CNV) from scRNA-seq data is critical for identifying malignant cells, reconstructing tumor subclonal architecture, and uncovering the genomic drivers that dictate cancer cell biology. However, the performance of existing tools varies significantly, and current benchmarks lack the breadth of datasets and methods necessary to provide definitive guidance. We present a comprehensive benchmark of 12 CNV inference methods across 28 real datasets (>100,000 cells) and diverse synthetic datasets. By evaluating methods based on malignant cell classification accuracy, CNV inference accuracy, scalability, and robustness, we establish a definitive practitioner’s guideline: allele-aware methods like Numbat excel when high-quality allelic inference can be achieved, whereas expression-centric tools such as Clonalscope, CopyKAT, inferCNV, and SCEVAN remain reliable when raw sequencing data are unavailable. Our study provides both a practical decision-making framework for researchers and a public repository of standardized CNV profiles to catalyze further methodological innovation.

Version published to 10.64898/2026.04.12.718050 on bioRxiv
Apr 15, 2026

CANCAN: high-resolution copy number and mutation heterogeneity analysis of DNA sequence data for clinical applications

This article has 14 authors:
1. Arne V Pladsen
2. Daniel Vodak
3. Sen Zhao
4. Sigve Nakken
5. Daniel Nebdal
6. Tonje Lien
7. Britina Kjuul Danielsen
8. Caroline Wang
9. Wanja Kildal
10. Geir Olav Hjortland
11. Olav Engebråten
12. Eivind Hovig
13. Hege G Russnes
14. Ole Christian Lingjærde
This article has no evaluationsLatest version May 19, 2026
MCNV2 (Mendelian CNV Validation): Mendelian Precision for CNV quality assessment

This article has 9 authors:
1. Mame Seynabou Diop
2. Audrey Lemacon
3. Kuldeep Kumar
4. Benjamin Clark
5. Guillaume Huguet
6. Florian Bénitière
7. Martineau Jean-Louis
8. Sylvie Hamel
9. Sébastien Jacquemont
This article has no evaluationsLatest version May 3, 2026
MethylBench: A comprehensive benchmark of DNA methylation profiling methods across diverse sequencing platforms

This article has 13 authors:
1. Lukas Laufer
2. Gilles Gasparoni
3. Thomas Hentrich
4. Linda Sofan
5. Jakob Admard
6. Elena Buena-Atienza
7. Michaela Pogoda
8. Stephan Ossowski
9. Nicolas Casadei
10. Olaf Rieß
11. Tobias B. Haack
12. Rebecca Buchert
13. Julia Schulze-Hentrich
This article has no evaluationsLatest version Apr 30, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

CANCAN: high-resolution copy number and mutation heterogeneity analysis of DNA sequence data for clinical applications

MCNV2 (Mendelian CNV Validation): Mendelian Precision for CNV quality assessment

MethylBench: A comprehensive benchmark of DNA methylation profiling methods across diverse sequencing platforms