Using structure prediction of negative sense RNA virus nucleoproteins to assess evolutionary relationships

Read the full article See related articles

Listed in

This article is not in any list yet, why not save it to one of your lists.
Log in to save this article

Abstract

Negative sense RNA viruses (NSV) include some of the most detrimental human pathogens, including the influenza, Ebola and measles viruses. NSV genomes consist of one or multiple single-stranded RNA molecules that are encapsidated into one or more ribonucleoprotein (RNP) complexes. Current evolutionary relationships within the NSV phylum are based on alignment of conserved RNA-dependent RNA polymerase (RdRp) domain amino acid sequences. However, the RdRp-based phylogeny does not address whether other core proteins in the NSV genome evolved along the same trajectory. Moreover, the current classification of NSVs does not consistently match the segmented and non-segmented nature of negative-sense virus genomes. Viruses belonging to e.g. the Serpentovirales have a segmented genome but are classified among the non-segmented negative-sense RNA viruses. We hypothesized that RNA genome segmentation is not coupled to the RdRp domain, but rather to the nucleocapsid protein (NP) that forms RNP complexes with the viral RNA. Because NP sequences are too short to infer robust phylogenetic relationships, we here used experimentally-obtained and AlphaFold 2.0-predicted NP structures to probe whether evolutionary relationships can be estimated using NSV NP sequences and potentially improve our understanding of the relationships between NSV subphyla and the NSV genome organization. Following flexible structure alignments of modeled structures, we find that the structural homology of the NSV NPs reveals phylogenetic clusters that are consistent with the currently accepted NSV taxonomy based on RdRp sequences with one key difference: the NPs of the segmented Serpentovirales cluster with the other segmented NSV. In addition, we were able to assign viruses for which RdRp sequences are currently missing to phylogenetic clusters. Overall, our results suggest that the NSV RdRp and NP genes largely evolved along similar trajectories, that NP-based clustering is better correlated with the NSV genome structure organization, and that even short pieces of genetic, protein-coding information can be used to infer evolutionary relationships, potentially making metagenomic analyses more valuable.

Article activity feed