Target-enriched sequencing enables genomic characterization within diverse microbial populations – a preprint

Enrique Doster
Lee J. Pinnell
Cory A. Wolfe
Noelle R. Noyes
Robert Valeris-Chacin
William B. Crosby
Michael L. Clawson
Amelia R. Woolums
Paul S. Morley

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Characterizing microbial genetic sequences and key variants is critical for understanding pathogen ecology, transmission, and clinical impact. Yet, conventional metagenomic sequencing often yields too few on-target reads to move beyond species-level identification. We developed a target-enriched (TE) metagenomic workflow, including bait design, an optimized TE shotgun protocol, and the VARIANT++ pipeline, to recover and classify reads at a clustered genomic sequence-variant (GSV) level (see Graphical abstract). The computational component clusters reference genomes by average nucleotide identity, builds a GSV database, and integrates Kraken2, Themisto, and mSWEEP to increase call confidence while reducing false positives. Using Mannheimia haemolytica ( Mh ), the primary cause of bovine respiratory disease, we designed 114,375 DNA baits targeting sequences across 70 reference genomes. TE libraries from nasopharyngeal swabs of feedlot cattle achieved >250-fold increases in on-target Mh reads (∼2.5% of non-host reads on average) compared with conventional shotgun sequencing, despite using one-quarter the sequencing depth. This variant-level resolution revealed six GSVs; most samples contained at least two, indicating variant mixtures difficult to detect with culture- or shotgun-based surveys. Because the approach leverages available reference sequences, it can be reconfigured for other microbial targets. TE metagenomics paired with genome-similarity clustering provides a scalable approach to variant-level characterization from complex microbial populations.

Abstract Figure

Graphical abstract

Overview of the components in our three-part workflow.

Version published to 10.1101/2025.10.23.684174 on bioRxiv
Oct 24, 2025

Shotgun metagenomics: a deep insight into the composition and function of the complex microbial world

This article has 7 authors:
1. Grazia Visci
2. Elisabetta Notario
3. Giuseppe Defazio
4. Mariano Francesco Caratozzolo
5. Bruno Fosso
6. Marinella Marzano
7. Graziano Pesole
This article has no evaluationsLatest version Jan 30, 2026
One Health Viral Metagenomics for Pandemic Preparedness: Validated mNGS Workflows for Viral Detection and Genome Recovery from Swab and Tissue Specimens

This article has 14 authors:
1. Tristan Russell
2. Elisa Formiconi
3. Alison Murphy
4. Jimmy Hortion
5. Máire McElroy
6. Mícheál Casey
7. Laura Garza Cuartero
8. John F Mee
9. Hanne Jahns
10. Christine Kelly
11. Joanne Byrne
12. Eoin R Feeney
13. Patrick WG Mallon
14. Virginie W Gautier
This article has no evaluationsLatest version Jan 16, 2026
Quantitative evaluation of microbiome sequencing resolution under varying experimental conditions using defined mock communities

This article has 5 authors:
1. Songhee Lee
2. Hyeonah Lee
3. Jung Wook Kim
4. Hyeon-Jin Kim
5. Kwang Jun Lee
This article has no evaluationsLatest version Dec 30, 2025

Discuss this preprint

Listed in

Abstract

Abstract Figure

Article activity feed

Related articles

Shotgun metagenomics: a deep insight into the composition and function of the complex microbial world

One Health Viral Metagenomics for Pandemic Preparedness: Validated mNGS Workflows for Viral Detection and Genome Recovery from Swab and Tissue Specimens

Quantitative evaluation of microbiome sequencing resolution under varying experimental conditions using defined mock communities