Genomic Surveillance of SARS-CoV-2 in Erie County, New York

Abstract

Early in the SAR-CoV-2 pandemic, we established a whole genome sequencing pipeline to assess lineages circulating in Western New York. Initial sequences revealed entry into the region via Europe, similar to observations in New York City. However, as the pandemic progressed and variants of concern emerged, we observed distinct patterns in lineages relative to NYC. Notably, B.1.427 became dominant in Western New York, before it was displaced by B.1.1.7. Our hierarchical cluster analysis of B.1.1.7 lineages, which by May 2021 made up ∼ 80% of all cases, indicated both multiple introductions and community spread. Our work highlights the importance of widespread, regional surveillance of SARS-CoV-2 across the United States.

SciScore for 10.1101/2021.07.01.21259869: (What is this?)

Please note, not all rigor criteria are appropriate for all manuscripts.

Table 1: Rigor

Ethics	IRB: This study was reviewed by the University at Buffalo Institutional Review Board and determined to be “Not Human Research” (IRB ID: STUDY00004515).
Sex as a biological variable	not detected.
Randomization	not detected.
Blinding	not detected.
Power Analysis	not detected.

Table 2: Resources

Software and Algorithms
Sentences	Resources
Denatured libraries were diluted to a final concentration in Illumina HT1 buffer(12.5 pM for MiSeq and 1.5 pM for NextSeq).	MiSeq suggested: (A5-miseq, RRID:SCR_012148)
Upon completion of the sequencing run, data were transferred to the high-performance computing facility (Center for Computational Research) located in the Center of Excellence building at the University at …

SciScore for 10.1101/2021.07.01.21259869: (What is this?)

Please note, not all rigor criteria are appropriate for all manuscripts.

Table 1: Rigor

Ethics	IRB: This study was reviewed by the University at Buffalo Institutional Review Board and determined to be “Not Human Research” (IRB ID: STUDY00004515).
Sex as a biological variable	not detected.
Randomization	not detected.
Blinding	not detected.
Power Analysis	not detected.

Table 2: Resources

Software and Algorithms
Sentences	Resources
Denatured libraries were diluted to a final concentration in Illumina HT1 buffer(12.5 pM for MiSeq and 1.5 pM for NextSeq).	MiSeq suggested: (A5-miseq, RRID:SCR_012148)
Upon completion of the sequencing run, data were transferred to the high-performance computing facility (Center for Computational Research) located in the Center of Excellence building at the University at Buffalo. UB GBC SARS-COV-2 Bioinformatics Analysis: The GBC SARS-COV-2 analysis pipeline (https://github.com/UBGBC/fastq-to-consensus) is modelled off of the recommendations provided by the CDC SARS-COV-2 spheres working group (https://github.com/CDCgov/SARS-CoV-2_Sequencing), and is written in the python pipeline framework Snakemake.	python suggested: (IPython, RRID:SCR_001658)
Then, reads are checked for initial quality using fastqc, fastq_screen, and multiqc, prior to adapter removal analysis via the tool Cutadapt.	Cutadapt suggested: (cutadapt, RRID:SCR_011841)
To detect inter-lineage variation, we compared each sample’s spread of variants utilizing the Bedtools jaccard function, which generates a Jaccard index score between every sample.	Bedtools suggested: (BEDTools, RRID:SCR_006646)
The resulting similarity matrix was then used as input for hierarchical cluster analysis in RStudio.	RStudio suggested: (RStudio, RRID:SCR_000432)
14 The resulting alignment was then used as input into the FastTree algorithm [price, 2009; price, 2010), inferring maximum-likelihood phylogeny using the jukes-cantor distance model of nucleotide evolution, generating a newick formatted phylogenetic tree.	FastTree suggested: (FastTree, RRID:SCR_015501)

Results from OddPub: We did not detect open data. We also did not detect open code. Researchers are encouraged to share open data when possible (see Nature blog).

Results from LimitationRecognizer: An explicit section about the limitations of the techniques employed in this study was not found. We encourage authors to address study limitations.

Results from TrialIdentifier: No clinical trial numbers were referenced.

Results from Barzooka: We did not find any issues relating to the usage of bar graphs.

Results from JetFighter: We did not find any issues relating to colormaps.

Results from rtransparent:

Thank you for including a conflict of interest statement. Authors are encouraged to include this statement when submitting to a journal.
Thank you for including a funding statement. Authors are encouraged to include this statement when submitting to a journal.
No protocol registration statement was detected.

Results from scite Reference Check: We found no unreliable references.

Read the original source

Genomic Surveillance of SARS-CoV-2 in Erie County, New York

This article has been Reviewed by the following groups

Discuss this preprint

Listed in

Abstract

Article activity feed

Genomic characterization of SARS-CoV-2 variants circulating in the population of Bangui, Central African Republic (CAR) in 2022.

Overview of SARS-CoV-2 Genomic Surveillance in Central America and the Dominican Republic from February 2020 to January 2023: The Impact of PAHO and COMISCA's Collaborative Efforts

DIVERSITY AND CLINICAL CORRELATIONS OF SARS-CoV-2 VARIANT DURING THE INTRODUCTION OF THE DELTA VARIANT IN GUATEMALA

This article has been Reviewed by the following groups

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Genomic characterization of SARS-CoV-2 variants circulating in the population of Bangui, Central African Republic (CAR) in 2022.

Overview of SARS-CoV-2 Genomic Surveillance in Central America and the Dominican Republic from February 2020 to January 2023: The Impact of PAHO and COMISCA's Collaborative Efforts

DIVERSITY AND CLINICAL CORRELATIONS OF SARS-CoV-2 VARIANT DURING THE INTRODUCTION OF THE DELTA VARIANT IN GUATEMALA