Purifying Selection Determines the Short-Term Time Dependency of Evolutionary Rates in SARS-CoV-2 and pH1N1 Influenza

Mahan Ghafari
Louis du Plessis
Jayna Raghwani
Samir Bhatt
Bo Xu
Oliver G Pybus
Aris Katzourakis

This article has been Reviewed by the following groups

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

Evaluated articles (ScreenIT)

Abstract

High-throughput sequencing enables rapid genome sequencing during infectious disease outbreaks and provides an opportunity to quantify the evolutionary dynamics of pathogens in near real-time. One difficulty of undertaking evolutionary analyses over short timescales is the dependency of the inferred evolutionary parameters on the timespan of observation. Crucially, there are an increasing number of molecular clock analyses using external evolutionary rate priors to infer evolutionary parameters. However, it is not clear which rate prior is appropriate for a given time window of observation due to the time-dependent nature of evolutionary rate estimates. Here, we characterize the molecular evolutionary dynamics of SARS-CoV-2 and 2009 pandemic H1N1 (pH1N1) influenza during the first 12 months of their respective pandemics. We use Bayesian phylogenetic methods to estimate the dates of emergence, evolutionary rates, and growth rates of SARS-CoV-2 and pH1N1 over time and investigate how varying sampling window and data set sizes affect the accuracy of parameter estimation. We further use a generalized McDonald–Kreitman test to estimate the number of segregating nonneutral sites over time. We find that the inferred evolutionary parameters for both pandemics are time dependent, and that the inferred rates of SARS-CoV-2 and pH1N1 decline by ∼50% and ∼100%, respectively, over the course of 1 year. After at least 4 months since the start of sequence sampling, inferred growth rates and emergence dates remain relatively stable and can be inferred reliably using a logistic growth coalescent model. We show that the time dependency of the mean substitution rate is due to elevated substitution rates at terminal branches which are 2–4 times higher than those of internal branches for both viruses. The elevated rate at terminal branches is strongly correlated with an increasing number of segregating nonneutral sites, demonstrating the role of purifying selection in generating the time dependency of evolutionary parameters during pandemics.

Version published to 10.1093/molbev/msac009
Jan 17, 2022

SciScore for 10.1101/2021.07.27.21261148: (What is this?)

Please note, not all rigor criteria are appropriate for all manuscripts.

Table 1: Rigor

NIH rigor criteria are not applicable to paper type.

Table 2: Resources

Software and Algorithms
Sentences	Resources
We downloaded all SARS-CoV-2 sequences from GISAID and pH1N1 influenza sequences from GenBank and align them using MUSCLE v3.8.42528 -- a complete metadata table acknowledging the authors, originating and submitting laboratories of the SARS-CoV-2 sequence data is available in Table S1.	MUSCLE suggested: (MUSCLE, RRID:SCR_011812)
4.1 Phylogenetic analyses: We use BEAST v1.1029 for the Bayesian phylogenetic analysis of the entire dataset using an HKY+G substitution model with a Laplace prior (mean=0 and scale=100) on the coalescent growth rate, a Lognormal prior (mean=1 and stdev=2) on the …

SciScore for 10.1101/2021.07.27.21261148: (What is this?)

Please note, not all rigor criteria are appropriate for all manuscripts.

Table 1: Rigor

NIH rigor criteria are not applicable to paper type.

Table 2: Resources

Software and Algorithms
Sentences	Resources
We downloaded all SARS-CoV-2 sequences from GISAID and pH1N1 influenza sequences from GenBank and align them using MUSCLE v3.8.42528 -- a complete metadata table acknowledging the authors, originating and submitting laboratories of the SARS-CoV-2 sequence data is available in Table S1.	MUSCLE suggested: (MUSCLE, RRID:SCR_011812)
4.1 Phylogenetic analyses: We use BEAST v1.1029 for the Bayesian phylogenetic analysis of the entire dataset using an HKY+G substitution model with a Laplace prior (mean=0 and scale=100) on the coalescent growth rate, a Lognormal prior (mean=1 and stdev=2) on the coalescent population size, and a continuous time Markov chain prior on the evolutionary clock rate.	BEAST suggested: (BEAST, RRID:SCR_010228)
We ensure that the effective sample size for every parameter of interest is >200 using Tracer v1.732.	Tracer suggested: (Tracer, RRID:SCR_019121)

Results from OddPub: Thank you for sharing your code and data.

Results from LimitationRecognizer: An explicit section about the limitations of the techniques employed in this study was not found. We encourage authors to address study limitations.

Results from TrialIdentifier: No clinical trial numbers were referenced.

Results from Barzooka: We did not find any issues relating to the usage of bar graphs.

Results from JetFighter: We did not find any issues relating to colormaps.

Results from rtransparent:

Thank you for including a conflict of interest statement. Authors are encouraged to include this statement when submitting to a journal.
Thank you for including a funding statement. Authors are encouraged to include this statement when submitting to a journal.
No protocol registration statement was detected.

Results from scite Reference Check: We found no unreliable references.

Read the original source

Version published to 10.1101/2021.07.27.21261148 on medRxiv
Jul 30, 2021

Molecular Evolution of the <i>Fusion</i> (<i>F</i>) Genes in Human Metapneumovirus Genotype B

This article has 10 authors:
1. Tatsuya Shirai
2. Fuminori Mizukoshi
3. Mitsuru Sada
4. Kazuya Shirato
5. Takeshi Saraya
6. Haruyuki Ishii
7. Ryusuke Kimura
8. Toshiyuki Sugai
9. Akihide Ryo
10. Hirokazu Kimura
This article has no evaluationsLatest version Dec 23, 2025
Two years of genomic surveillance capacity development in Guinea: an operational roadmap for local implementation in low-income countries and tracking of SARS-CoV-2 circulation dynamics

This article has 46 authors:
1. Magassouba Magassouba
2. Emanuele Gustani-Buss
3. Kékoura Ifono
4. Emily Victoria Nelson
5. Jacob Camara
6. Annibaldis Giuditta
7. Annick Renevey
8. Julia Hinzmann
9. Mette Hinrichs
10. Sarah Ryter
11. Ehizojie Emua
12. Saa Lucien Millimono
13. Eugene Kolie
14. Moussa Condé
15. Bakary Sylla
16. Nourdine Ibrahim
17. Stephane Mely
18. Hugo Soubrier
19. Joëlle Goüy de Bellocq
20. Beatriz Escudero-Pérez
21. Laura N. Cuypers
22. Elodie Moissonnier
23. Lien De Caluwé
24. Jonas Müller
25. Anke Thielebein
26. Alexandru Tomazatos
27. Christine Jacobsen
28. Meike Pahlmann
29. Beate Becker-Ziaja
30. Cyril Erameh
31. Sylvanus Okogbenin
32. Fara Raymond Koundouno
33. Youssouf Sidibé
34. Kaba Keïta
35. Mamadou Boye Keita
36. Gianluca Loi
37. Moke Fundji Jean Marie Kipela
38. Georges Alfred Ki-Zerbo
39. Seydou Dia
40. Philippe Lemey
41. Stephan Günther
42. Camara
43. Barré Soropogui
44. Liana Eleni Kafetzopoulou
45. Sanaba Boumbaly
46. Sophie Duraffour
This article has no evaluationsLatest version Jan 22, 2026
Emergence and Evolution of Triple Reassortant Highly Pathogenic Avian Influenza A(H5N1) Virus, Argentina, 2025

This article has 15 authors:
1. Estefania Benedetti
2. Maria Carolina Artuso
3. Alexander M. P. Byrne
4. Maria de Belen Garibotto
5. Martín Avaro
6. Luana Erica Piccini
7. Ariana Chamorro
8. Marcelo Sciorra
9. Vanina Daniela Marchione
10. Mara Laura Russo
11. Maria Elena Dattero
12. Erika Macias Machicado
13. Monica Galiano
14. Nicola Lewis
15. Andrea Veronica Pontoriero
This article has no evaluationsLatest version Dec 10, 2025

This article has been Reviewed by the following groups

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Molecular Evolution of the <i>Fusion</i> (<i>F</i>) Genes in Human Metapneumovirus Genotype B

Two years of genomic surveillance capacity development in Guinea: an operational roadmap for local implementation in low-income countries and tracking of SARS-CoV-2 circulation dynamics

Emergence and Evolution of Triple Reassortant Highly Pathogenic Avian Influenza A(H5N1) Virus, Argentina, 2025