Covariant Fitness Clusters Reveal Structural Evolution of SARS-CoV-2 Polymerase Across the Human Population

This article has been Reviewed by the following groups

Read the full article

Abstract

Understanding the fitness landscape of viral mutations is crucial for uncovering the evolutionary mechanisms contributing to pandemic behavior. Here, we apply a Gaussian process regression (GPR) based machine learning approach that generates spatial covariance (SCV) relationships to construct stability fitness landscapes for the RNA-dependent RNA polymerase (RdRp) of SARS- CoV-2. GPR generated fitness scores capture on a residue-by-residue basis a covariant fitness cluster centered at the C487-H642-C645-C646 Zn 2+ binding motif that iteratively evolves since the early phase pandemic. In the Alpha and Delta variant of concern (VOC), multi-residue SCV interactions in the NiRAN domain form a second fitness cluster contributing to spread. Strikingly, a novel third fitness cluster harboring a Delta VOC basal mutation G671S augments RdRp structural plasticity to potentially promote rapid spread through viral load. GPR principled SCV provides a generalizable tool to mechanistically understand evolution of viral genomes at atomic resolution contributing to fitness at the pathogen-host interface.

Article activity feed

  1. SciScore for 10.1101/2022.01.07.475295: (What is this?)

    Please note, not all rigor criteria are appropriate for all manuscripts.

    Table 1: Rigor

    NIH rigor criteria are not applicable to paper type.

    Table 2: Resources

    Software and Algorithms
    SentencesResources
    Foldx has been widely used for assessing structural stability impacted by genetic mutations (102–106).
    Foldx
    suggested: (FoldX, RRID:SCR_008522)
    The protein mutations S647I and L638F were constructed using the WT structure and mutating the residues S647 and L638, respectively, with CHARMM to replace the atomic coordinates of the side chains.
    CHARMM
    suggested: (CHARMM, RRID:SCR_014892)
    The structural presentations were produced by the software of PyMOL.
    PyMOL
    suggested: (PyMOL, RRID:SCR_000305)

    Results from OddPub: We did not detect open data. We also did not detect open code. Researchers are encouraged to share open data when possible (see Nature blog).


    Results from LimitationRecognizer: An explicit section about the limitations of the techniques employed in this study was not found. We encourage authors to address study limitations.

    Results from TrialIdentifier: No clinical trial numbers were referenced.


    Results from Barzooka: We did not find any issues relating to the usage of bar graphs.


    Results from JetFighter: Please consider improving the rainbow (“jet”) colormap(s) used on pages 42, 44, 46, 48, 54 and 61. At least one figure is not accessible to readers with colorblindness and/or is not true to the data, i.e. not perceptually uniform.


    Results from rtransparent:
    • Thank you for including a conflict of interest statement. Authors are encouraged to include this statement when submitting to a journal.
    • Thank you for including a funding statement. Authors are encouraged to include this statement when submitting to a journal.
    • No protocol registration statement was detected.

    Results from scite Reference Check: We found no unreliable references.


    About SciScore

    SciScore is an automated tool that is designed to assist expert reviewers by finding and presenting formulaic information scattered throughout a paper in a standard, easy to digest format. SciScore checks for the presence and correctness of RRIDs (research resource identifiers), and for rigor criteria such as sex and investigator blinding. For details on the theoretical underpinning of rigor criteria and the tools shown here, including references cited, please follow this link.