SARS-CoV-2 NSP3, NSP4 and NSP6 mutations and Epistasis during the pandemic in the world: Evolutionary Trends and Natural Selections in Six Continents

This article has been Reviewed by the following groups

Read the full article

Abstract

Background

The Coronavirus 2019 (COVID-19) was named by the World Health Organization (WHO) due to its rapid transmittable potential and high mortality rate. Based on the critical role of None Structural Proteins (NSP), NSP3, NSP4, and NSP6 in COVID-19, this study attempts to investigate the superior natural selection mutations and Epistasis among these none structural proteins.

Methods

Approximately 6.5 million SARS-CoV-2 protein sequences of each NSP3, NSP4, and NSP6 nonstructural protein were analyzed from January 2020 to January 2022. Python programming language was utilized to preprocess and apply inclusion criteria on the FASTA file to prepare a list of suitable samples. NSP3, NSP4, and NSP6 were aligned to the reference sequence to compare and identify mutation patterns categorized based on frequency, geographical zone distribution, and date. To discover epistasis situations, linear regression between mutation frequency and date among candidate genes was performed to determine correlations.

Results

The rate of NSP3, NSP4, and NSP6 mutations in divided geographical areas was different. Based on continental studies, P1228L (54.48%), P1469S (54.41%), and A488S (53.86%) mutations in NSP3, T492I (54.84%), and V167L (52.81%) in NSP4 and T77A (69.85%) mutation in NSP6 increased over time, especially in recent months. For NSP3, Europe had the highest P1228L, P1469S, and A488S mutations. For NSP4, Oceania had the highest T492I and V167L mutations, and for NSP6, Europe had the highest T77A mutation. Hot spot regions for NSP3, NSP4, and NSP6 were 1358 to 1552 AA, 150 to 200 AA, and 58 to 87 AA, respectively. Our results showed a significant correlation and co-occurrence between NSP3, NSP4, and NSP6 mutations.

Conclusion

We conclude that the effect of mutations on virus stability and replication can be predicted by examining the amino acid changes of P1228L, P1469S, A488S, T492I, V167L and T77A mutations. Also, these mutations can possibly be effective on the function of proteins and their targets in the host cell.

Article activity feed

  1. SciScore for 10.1101/2022.05.22.22275422: (What is this?)

    Please note, not all rigor criteria are appropriate for all manuscripts.

    Table 1: Rigor

    NIH rigor criteria are not applicable to paper type.

    Table 2: Resources

    Software and Algorithms
    SentencesResources
    2.2 Sequence analyses and processing: Python 3.8.0 language programming and ′Numpy′ and ′Pandas′ libraries were used in this research to preprocess FASTA files, extract NSP3, NSP4, and NSP6 from other genes, and perform sequence alignment.
    Python
    suggested: (IPython, RRID:SCR_001658)

    Results from OddPub: We did not detect open data. We also did not detect open code. Researchers are encouraged to share open data when possible (see Nature blog).


    Results from LimitationRecognizer: An explicit section about the limitations of the techniques employed in this study was not found. We encourage authors to address study limitations.

    Results from TrialIdentifier: No clinical trial numbers were referenced.


    Results from Barzooka: We did not find any issues relating to the usage of bar graphs.


    Results from JetFighter: We did not find any issues relating to colormaps.


    Results from rtransparent:
    • Thank you for including a conflict of interest statement. Authors are encouraged to include this statement when submitting to a journal.
    • Thank you for including a funding statement. Authors are encouraged to include this statement when submitting to a journal.
    • Thank you for including a protocol registration statement.

    Results from scite Reference Check: We found no unreliable references.


    About SciScore

    SciScore is an automated tool that is designed to assist expert reviewers by finding and presenting formulaic information scattered throughout a paper in a standard, easy to digest format. SciScore checks for the presence and correctness of RRIDs (research resource identifiers), and for rigor criteria such as sex and investigator blinding. For details on the theoretical underpinning of rigor criteria and the tools shown here, including references cited, please follow this link.