Enhancing inference of differential gene expression in metatranscriptomes from human microbial communities

Read the full article See related articles

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.
Log in to save this article

Abstract

Metatranscriptomic (MTX) sequencing quantifies gene expression from the collective genomes of microbial communities (microbiomes), enabling assessment of functional activity rather than functional potential. While differential expression testing is instrumental to RNA-sequencing analysis, current metatranscriptomic approaches have been benchmarked only on simulated data and not under real operating conditions, resulting in a lack of standard practices. Here, we evaluate the performance of statistical differential expression methods on both simulated datasets and data collected from ‘mock communities’ (mixtures of bacterial cells of different types in defined ratios) designed for this purpose. We assess the robustness of individual methods to organisms’ low relative abundance, differential abundance, low prevalence, transcription rate changes, and compositional effects, showing that no existing methods perform adequately across all confounding conditions. We then apply the same approaches to metatranscriptomic datasets generated from gnotobiotic mice colonized with defined consortia of human bacterial strains and show that the method nominated by the mock community comparisons successfully inferred cross-feeding dynamics that were subsequently validated in vitro . Finally, using metagenome-assembled genomes from a human clinical study, we leverage genome-level sequencing depth and detection of genes to exclude low information samples on a per-organism basis to overcome confounding low prevalence and enhance differential expression inference. We conclude that MTX method benchmarking on real, not simulated, datasets can and should optimize model implementation, enabling inference and validation of microbial metabolic strategies and interactions from in vivo datasets.

Article activity feed