Recommendations for Uniform Variant Calling of SARS-CoV-2 Genome Sequence across Bioinformatic Workflows

Ryan Connor
Migun Shakya
David A. Yarmosh
Wolfgang Maier
Ross Martin
Rebecca Bradford
J. Rodney Brister
Patrick S. G. Chain
Courtney A. Copeland
Julia di Iulio
Bin Hu
Philip Ebert
Jonathan Gunti
Yumi Jin
Kenneth S. Katz
Andrey Kochergin
Tré LaRosa
Jiani Li
Po-E Li
Chien-Chi Lo
Sujatha Rashid
Evguenia S. Maiorova
Chunlin Xiao
Vadim Zalunin
Lisa Purcell
Kim D. Pruitt

This article has been Reviewed by the following groups

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

Evaluated articles (Arcadia Science)

Abstract

Genomic sequencing of clinical samples to identify emerging variants of SARS-CoV-2 has been a key public health tool for curbing the spread of the virus. As a result, an unprecedented number of SARS-CoV-2 genomes were sequenced during the COVID-19 pandemic, which allowed for rapid identification of genetic variants, enabling the timely design and testing of therapies and deployment of new vaccine formulations to combat the new variants. However, despite the technological advances of deep sequencing, the analysis of the raw sequence data generated globally is neither standardized nor consistent, leading to vastly disparate sequences that may impact identification of variants. Here, we show that for both Illumina and Oxford Nanopore sequencing platforms, downstream bioinformatic protocols used by industry, government, and academic groups resulted in different virus sequences from same sample. These bioinformatic workflows produced consensus genomes with differences in single nucleotide polymorphisms, inclusion and exclusion of insertions, and/or deletions, despite using the same raw sequence as input datasets. Here, we compared and characterized such discrepancies and propose a specific suite of parameters and protocols that should be adopted across the field. Consistent results from bioinformatic workflows are fundamental to SARS-CoV-2 and future pathogen surveillance efforts, including pandemic preparation, to allow for a data-driven and timely public health response.

Version published to 10.3390/v16030430
Mar 11, 2024
Arcadia Science
Nov 4, 2022

We introduce the use of a specific suite of parameters and protocols that greatly improves the agreement among pipelines developed by diverse organizations.

From reading the paper it seems like this was done on samples where the SARS-COV-2 in the sample is likely to be genetically uniform. Have you done this at all on samples such as those from wastewater or sewage or environmental swabs where there are likely to be many different variants within a single sample?

Read the original source
Version published to 10.1101/2022.11.03.515010 on bioRxiv
Nov 3, 2022

Rapid Phylogenomic Analysis of Thousands Outbreak‐Causing Viral Genomes Using Covary

This article has 1 author:
1. Marvin I. De los Santos
This article has no evaluationsLatest version Dec 22, 2025
One Health Viral Metagenomics for Pandemic Preparedness: Validated mNGS Workflows for Viral Detection and Genome Recovery from Swab and Tissue Specimens

This article has 14 authors:
1. Tristan Russell
2. Elisa Formiconi
3. Alison Murphy
4. Jimmy Hortion
5. Máire McElroy
6. Mícheál Casey
7. Laura Garza Cuartero
8. John F Mee
9. Hanne Jahns
10. Christine Kelly
11. Joanne Byrne
12. Eoin R Feeney
13. Patrick WG Mallon
14. Virginie W Gautier
This article has no evaluationsLatest version Jan 16, 2026
Harnessing Genomics for Public Health: Use-Case Insights from Uganda

This article has 32 authors:
1. Aloysious Ssemaganda
2. Alisen Ayitewala
3. Stephen Kanyerezi
4. Hellen Rosette Oundo
5. Julius Seruyange
6. Wilson Tenywa
7. Godwin Tusabe
8. Stacy Were
9. Moses Murungi
10. Ivan Sserwadda
11. Shahiid Kiyaga
12. Jupiter Marina Kabahita
13. Caroline Makoha
14. Stephen Tukwasibwe
15. Thomas Katairo
16. Andrew Nsawotebba
17. Bosco B. Agaba
18. Collins Kipngetich Tanui
19. Harris Onywera
20. Gerald Mboowa
21. Sofonias K. Tessema
22. Henry Kyobe Bosa
23. Ritah Namusoosa
24. Ibrahim Mugerwa
25. Saudah Namubiru
26. Grace Najjuka
27. Caroline Achola
28. Charles Olaro
29. Diana Atwine
30. Jane Ruth Aceng
31. Susan Nabadda
32. Isaac Ssewanyana
This article has no evaluationsLatest version Dec 16, 2025

This article has been Reviewed by the following groups

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Rapid Phylogenomic Analysis of Thousands Outbreak‐Causing Viral Genomes Using Covary

One Health Viral Metagenomics for Pandemic Preparedness: Validated mNGS Workflows for Viral Detection and Genome Recovery from Swab and Tissue Specimens

Harnessing Genomics for Public Health: Use-Case Insights from Uganda