Phylogenetic discordance can substantially overestimate genomic reassortment in avian influenza virus

Read the full article See related articles

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.
Log in to save this article

Abstract

Recombination plays an important role in the evolution of RNA viruses, as it allows the exchange of genetic material between viral lineages. Reassortment, a form of recombination specific to segmented genomes, involves the exchange of entire segments and has contributed significantly to the adaptation and spread of influenza viruses through novel genomic combinations, i.e ., antigenic shifts. It is usually identified by phylogenetic discordance: differences in the topologies of trees reconstructed from different genomic segments. However, phylogenetic discordance can also result from error in reconstructing trees. To characterize the impact of reconstruction error, we curated a database of n = 11, 765 complete genomes of avian H5Nx influenza A viruses from avian hosts. We found evidence of widespread reassortment as measured by inferred subtree-prune-regraft (SPR) events, consistent with previous studies. Next, we ran replicate simulations of sequence evolution along the reference tree for the segment encoding hemagglutinin (HA), adjusting simulations for the lengths and clock rates of the other segments. These simulations provided a baseline for the expected amount of phylogenetic discordance in the absence of any reassortment. When sampling HA sequences at random from the database to build reference trees, we observed that simulating other segments without reassortment still yielded about 32% as many SPRs as the real segment data on average. The average proportion of SPRs without reassortment was greatly reduced (4%) if we selected an equivalent number of HA sequences retaining the most genetic diversity, which was consistent with the accuracy of phylogenetic reconstruction being the limiting factor. This implies that measuring reassortment by SPRs may have a high false positive rate, and that previous evidence of extensive reassortment in influenza viruses should be interpreted with caution. In addition, we observed that the SPRs reconstructed on simulated trees had significantly shorter distances between the prune and regraft locations than real trees. These results suggest that down-sampling sequences to maximize evolutionary divergence and filtering out the shortest SPRs may be effective measures against false positives.

Article activity feed