A public resource of 15 genomically characterised representative strains of Shigella sonnei

Read the full article See related articles

Listed in

This article is not in any list yet, why not save it to one of your lists.
Log in to save this article

Abstract

Shigella sonnei is rapidly emerging as the dominant agent of shigellosis, an enteric disease responsible for a significant burden of morbidity and mortality worldwide. Whole-genome sequencing of S. sonnei isolated over the last three decades has revealed phylogenomic diversity within the population and the emergence of multiple lineages associated with distinct epidemiological patterns such as resistance to critical antimicrobials and/or transmission within different groups. However most experimental work on S. sonnei biology and pathogenicity has focused on a single laboratory strain (53G), which is phylogenetically distant from currently circulating strains. Here we introduce a set of 15 phylogenetically diverse and epidemiologically relevant S. sonnei isolates made available through publicly accessible culture collections as a resource for laboratory science. We present their complete whole-genome sequences, including the pINV invasion plasmid (missing from a large proportion of public genome data due to loss during laboratory culture). Finally, the characterisation and comparison of these complete genome sequences highlight evidence for ongoing adaptive evolution in S. sonnei , featuring the accumulation of insertion sequences, gene pseudogenisation and structural variation.

Significance as a BioResource to the community

Genomic analysis of Shigella has historically been challenging due to presence of hundreds of repetitive sequence elements (which can cause fragmented assemblies) and loss of the pINV invasion plasmid (essential to virulence) during laboratory culture. Furthermore, most experimental work on S. sonnei pathogenicity uses a lab strain that is phylogenetically distant from circulating isolates. To support S. sonnei experimental and in silico research and increase its relevance to current clinical problems, we report here the complete, high-quality genome sequences of 15 S. sonnei isolates, each selected to represent distinct sub-clades of epidemiological interest. We also make the corresponding strains publicly available in national reference culture collections.

Data summary

All sequencing reads and complete assemblies have been deposited into the National Center for Biotechnology Information (NCBI) database (accessions to be determined). Genome assemblies and Bakta annotations used in the analysis can be found in Figshare ( https://doi.org/10.6084/m9.figshare.28302986 ) together with Mauve multiple-sequence alignments for the chromosome and pINV plasmid sequences, and genome-scale metabolic models produced for each strain.

Pure cultures of all strains were deposited in the publicly accessible National Collection of Type Cultures (NCTC, UK) or the “ Collection de l’Institut Pasteur ” (CIP, France) (accessions to be determined).

Article activity feed