Genome dynamics across the evolutionary transition to endosymbiosis

This article has been Reviewed by the following groups

Read the full article See related articles

Listed in

Log in to save this article

Abstract

Endosymbiosis – where a microbe lives and replicates within a host – is an important contributor to organismal function that has accelerated evolutionary innovations and catalysed the evolution of complex life. The evolutionary processes associated with transitions to endosymbiosis, however, are poorly understood. Here, we use comparative genomics of the genus Arsenophonus to reveal the complex processes that occur on evolution of an endosymbiotic lifestyle. We compared the genomes of 38 strains spanning diverse lifestyles from environmentally acquired infections to obligate inter-dependent endosymbionts. We observed recent endosymbionts had larger genome sizes than closely related environmentally acquired strains, consistent with evolutionary innovation and rapid gain of new function. Increased genome size was a consequence of prophage and plasmid acquisition including a cargo of type III effectors, and concomitant loss of CRISPR-Cas genome defence systems enabling mobile genetic element expansion. Persistent endosymbiosis was also associated with loss of type VI secretion, likely reflecting reduced microbe-microbe competition. Thereafter, the transition to stable endosymbiosis and vertical inheritance was associated with the expected relaxation of purifying selection, pseudogenisation of genes and reduction of metabolism, leading to genome reduction. However, reduced %GC that is typically considered a progressive linear process was observed only in obligate interdependent endosymbionts. We argue that a combination of the need for rapid horizontal gene transfer-mediated evolutionary innovation together with reduced phage predation in endosymbiotic niches drives loss of genome defence systems and rapid genome expansion upon adoption of endosymbiosis. These remodelling processes precede the reductive evolution traditionally associated with adaptation to endosymbiosis.

Article activity feed

  1. Alignment of protein sequences was performed with mafft using the “-auto” option and back translated to nucleotide alignments using pal2nal [53]. A phylogenetic tree was estimated using the JTTDCMut+F+I+G4 model in IQ-TREE v1.6.12 on the concatenated data of the same set of 188 orthologues protein clusters. This tree was then used to identify genes with significant evidences of relaxation or intensification of selection using the RELAX hypothesis testing framework in HyPhy package [54].

    Given that amino acid sequences were back translated to nucleotide alignments, it would be worth assessing to what extent the tests of relaxed selection may be sensitive to/impacted by alternative translated alignments. That is, each amino acid sequence has multiple alternative nucleotide alignments.

    I'm not certain how pal2nal works exactly, but from what I can tell it appears to generate a single translated alignment. Depending on how it "chooses" which alignment to generate, this could artificially inflate (or deflate) signal of relaxed or positive selection, as it may erode the very signals these tests use to fit model parameters.