Comprehensive analysis of insertion sequences within rRNA genes of CPR bacteria and biochemical characterization of a homing endonuclease encoded by these sequences

Read the full article See related articles

Listed in

This article is not in any list yet, why not save it to one of your lists.
Log in to save this article

Abstract

The Candidate Phyla Radiation (CPR) represents an extensive bacterial clade comprising primarily uncultured lineages and is distinguished from other bacteria by a significant prevalence of insertion sequences (ISs) within their rRNA genes. However, our understanding of the taxonomic distribution and characteristics of these ISs remains limited. In this study, we used a comprehensive approach to systematically determine the nature of the rRNA ISs in CPR bacteria. The analysis of hundreds of rRNA gene sequences across 65 CPR phyla revealed that ISs are present in 48% of 16S rRNA genes and 82% of 23S rRNA genes, indicating a broad distribution across the CPR clade, with exceptions in the 16S and 23S rRNA genes of Saccharibacteria and the 16S rRNA genes of Peregrinibacteria. Over half the ISs display a group-I-intron-like structure, whereas specific 16S rRNA gene ISs display features reminiscent of group II introns. The ISs frequently encode proteins with homing endonuclease (HE) domains, centered around the LAGLIDADG motif. The LAGLIDADG HE (LHE) proteins encoded by the rRNA ISs of CPR bacteria predominantly have a single-domain structure, deviating from the usual single- or double-domain configuration observed in typical prokaryotic LHEs. Experimental analysis of one LHE protein, I-ShaI from Ca . Shapirobacteria, confirmed that its endonuclease activity targets the DNA sequence of its insertion site, and chemical cross-linking experiments demonstrated its capacity to form homodimers. These results provide robust evidence supporting the hypothesis that the explosive proliferation of rRNA ISs in CPR bacteria was facilitated by mechanisms involving LHEs.

IMPORTANCE

Insertion sequences (ISs) in rRNA genes are relatively limited and infrequent in standard bacteria. With a comprehensive bioinformatic analysis, we show that in CPR bacteria, which are characterized by a high frequency of ISs, these ISs occur in 48% of 16S rRNA genes and 82% of 23S rRNA genes. We also report the systematic and biochemical characterization of the LAGLIDADG homing endonucleases (LHEs) encoded by these ISs in the first such analysis of the CPR bacteria. This study significantly extends our understanding of the phylogenetic positions of rRNA ISs within CPR bacteria and the biochemical features of their LHEs.

Article activity feed