The mutation rate of SARS-CoV-2 is highly variable between sites and is influenced by sequence context, genomic region, and RNA structure

Read the full article See related articles

Listed in

This article is not in any list yet, why not save it to one of your lists.
Log in to save this article

Abstract

RNA viruses like SARS-CoV-2 have a high mutation rate, which contributes to their rapid evolution. The rate of mutations depends on the mutation type (e.g., A → C , A → G , etc.) and can vary between sites in the viral genome. Understanding this variation can shed light on the mutational processes at play, and is crucial for quantitative modeling of viral evolution. Using the millions of available SARS-CoV-2 full-genome sequences, we estimate rates of synonymous mutations for all 12 possible nucleotide mutation types and examine how much these rates vary between sites. We find a surprisingly high level of variability and several striking patterns: the rates of four mutation types suddenly increase at one of two gene boundaries; the rates of most mutation types strongly depend on a site’s local sequence context, with up to 56-fold differences between contexts; consistent with a previous study, the rates of some mutation types are lower at sites engaged in RNA secondary structure. A simple log-linear model of these features explains ∼15-60% of the fold-variation of mutation rates between sites, depending on mutation type; more complex models only modestly improve predictive power out of sample. We estimate the fitness effect of each mutation based on the number of times it actually occurs versus the number of times it is expected to occur based on the model. We identify several small regions of the genome where synonymous or noncoding mutations occur much less often than expected, indicative of strong purifying selection on the RNA sequence that is independent of protein sequence. Overall, this work expands our basic understanding of SARS-CoV-2’s evolution by characterizing the virus’s mutation process at the level of individual sites and uncovering several striking mutational patterns that arise from unknown mechanisms.

Article activity feed