Identifying SARS-CoV-2 regional introductions and transmission clusters in real time

Jakob McBroome
Jennifer Martin
Adriano de Bernardi Schneider
Yatish Turakhia
Russell Corbett-Detig

This article has been Reviewed by the following groups

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

Evaluated articles (ScreenIT)

Abstract

The unprecedented SARS-CoV-2 global sequencing effort has suffered from an analytical bottleneck. Many existing methods for phylogenetic analysis are designed for sparse, static datasets and are too computationally expensive to apply to densely sampled, rapidly expanding datasets when results are needed immediately to inform public health action. For example, public health is often concerned with identifying clusters of closely related samples, but the sheer scale of the data prevents manual inspection and the current computational models are often too expensive in time and resources. Even when results are available, intuitive data exploration tools are of critical importance to effective public health interpretation and action. To help address this need, we present a phylogenetic summary statistic which quickly and efficiently identifies newly introduced strains in a region, resulting clusters of infected individuals, and their putative geographic origins. We show that this approach performs well on simulated data and is congruent with a more sophisticated analysis performed during the pandemic. We also introduce Cluster Tracker ( https://clustertracker.gi.ucsc.edu/ ), a novel interactive web-based tool to facilitate effective and intuitive SARS-CoV-2 geographic data exploration and visualization. Cluster-Tracker is updated daily and automatically identifies and highlights groups of closely related SARS-CoV-2 infections resulting from inter-regional transmission across the United States, streamlining public health tracking of local viral diversity and emerging infection clusters. The combination of these open-source tools will empower detailed investigations of the geographic origins and spread of SARS-CoV-2 and other densely-sampled pathogens.

ScreenIT
Jan 12, 2022
SciScore for 10.1101/2022.01.07.22268918: (What is this?)
Please note, not all rigor criteria are appropriate for all manuscripts.
Table 1: Rigor
NIH rigor criteria are not applicable to paper type.
Table 2: Resources
Software and Algorithms
Sentences Resources
We include Python scripts to create the backend data for the website display, contained in the “data” directory.
Python
suggested: (IPython, RRID:SCR_001658)
Results from OddPub: Thank you for sharing your code and data.
Results from LimitationRecognizer: An explicit section about the limitations of the techniques employed in this study was not found. We encourage authors to address study limitations.
Results from TrialIdentifier: No clinical trial numbers were referenced.
Results from Barzooka: We did not find any issues relating to the usage of bar graphs.
Results from JetFighter: We did not …
SciScore for 10.1101/2022.01.07.22268918: (What is this?)
Please note, not all rigor criteria are appropriate for all manuscripts.
Table 1: Rigor
NIH rigor criteria are not applicable to paper type.
Table 2: Resources
Software and Algorithms
Sentences Resources
We include Python scripts to create the backend data for the website display, contained in the “data” directory.
Python
suggested: (IPython, RRID:SCR_001658)
Results from OddPub: Thank you for sharing your code and data.
Results from LimitationRecognizer: An explicit section about the limitations of the techniques employed in this study was not found. We encourage authors to address study limitations.
Results from TrialIdentifier: No clinical trial numbers were referenced.
Results from Barzooka: We did not find any issues relating to the usage of bar graphs.
Results from JetFighter: We did not find any issues relating to colormaps.
Results from rtransparent:
Thank you for including a conflict of interest statement. Authors are encouraged to include this statement when submitting to a journal.
Thank you for including a funding statement. Authors are encouraged to include this statement when submitting to a journal.
No protocol registration statement was detected.
Results from scite Reference Check: We found no unreliable references.
About SciScore
SciScore is an automated tool that is designed to assist expert reviewers by finding and presenting formulaic information scattered throughout a paper in a standard, easy to digest format. SciScore checks for the presence and correctness of RRIDs (research resource identifiers), and for rigor criteria such as sex and investigator blinding. For details on the theoretical underpinning of rigor criteria and the tools shown here, including references cited, please follow this link.
Read the original source
Version published to 10.1101/2022.01.07.22268918 on medRxiv
Jan 10, 2022
Version published to 10.1093/ve/veac048
Jan 1, 2022

Software and Algorithms
Sentences	Resources
We include Python scripts to create the backend data for the website display, contained in the “data” directory.	Python suggested: (IPython, RRID:SCR_001658)

Software and Algorithms
Sentences	Resources
We include Python scripts to create the backend data for the website display, contained in the “data” directory.	Python suggested: (IPython, RRID:SCR_001658)

Evaluating Reference-Independent Pipelines for the Detection of Spreading Organisms in Metagenomic Datasets

This article has 7 authors:
1. N.S. Popov
2. V.V. Panova
3. M. Molchanova
4. S.A. Gurov
5. A.N. Lukashev
6. E.N. Ilina
7. A.I. Manolov
This article has no evaluationsLatest version May 6, 2026
Rapid phylogenomic analysis for viral surveillance and metagenomic profiling with Omni2Tree

This article has 9 authors:
1. Sina Majidian
2. Adrian Chalco
3. Xinchang Zheng
4. Richard J Webby
5. Andrew S Bowman
6. Rebecca L Poulson
7. Nicole M Nemeth
8. Fritz J Sedlazeck
9. Daniel P Agustinho
Reviewed by Rapid Reviews Infectious Diseases

This article has 3 evaluationsAppears in 1 listLatest version May 1, 2026Latest activity Jun 11, 2026
Integrated surveillance resolves Darién paradox of Oropouche virus emergence in Panama’s migration corridor

This article has 69 authors:
1. Xacdiel Rodríguez
2. Juan G. Perez-Jimenez
3. Laura W. Alexander
4. Carlos Lezcano-Coba
5. Josefrancisco Galué
6. Yelissa Juárez
7. Davis Beltrán
8. Darci R. Smith
9. Malik Kadir
10. Danielle W. Ali
11. Rita Corrales
12. Lidimarie Trujillo Rodriguez
13. Ghyssella E. Valdiviezo
14. Quinn K. Thomas
15. Anthony Cicalo
16. Maren Fitzpatrick
17. Andrea Luquette
18. LT. Cameron Sayer
19. Regina Z. Cer
20. Francisco Malagon
21. Ilka Anabel Grajales
22. Luis Felipe Rivera
23. Zuleyka González-R
24. Juan Antioco
25. Ellianys Walters-Valdes
26. Niccolò Meneghello-Ponce
27. Amy Y. Vittor
28. Kiriam Escobar-Lee
29. Aaron Abouganem-Shaw
30. Fátima Rodríguez
31. Eduardo Aguirre
32. Steev Loyola
33. Yeny Tinoco
34. Brechla Moreno
35. María Chen-Germán
36. Sonia Ampuero
37. Adrean Gómez-Angelo
38. Samir Correa-Duarte
39. José Acevedo
40. Blas Ramos
41. Maria Eugenia de Antinori
42. Claudia Gonzalez
43. Oris Chavarria
44. Jessica Gondola
45. Ambar Moreno
46. Celestino Aguilar
47. Pablo Gonzáles
48. Carmela Jackman
49. Hector Cedeño
50. Bernardo Gutiérrez
51. Moritz U.G. Kraemer
52. Victor Saldaña
53. Rodrigo DeAntonio
54. Alexander A Martinez
55. Blas Armién
56. Juan Miguel Pascale
57. Arlene Calvo
58. Mauricio L. Nogueira
59. William M. de Souza
60. Kathryn A. Hanley
61. Nuno R. Faria
62. Ilaria Dorigatti
63. Nikos Vasilakis
64. Christl A. Donnelly
65. Sandra López-Vergès
66. Kimberly A. Bishop-Lilly
67. Carla Mavian
68. Ana I. Bento
69. Jean-Paul Carrera
This article has no evaluationsLatest version Jun 1, 2026

This article has been Reviewed by the following groups

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Evaluating Reference-Independent Pipelines for the Detection of Spreading Organisms in Metagenomic Datasets

Rapid phylogenomic analysis for viral surveillance and metagenomic profiling with Omni2Tree

Integrated surveillance resolves Darién paradox of Oropouche virus emergence in Panama’s migration corridor