Consistency across multi‐omics layers in a drug‐perturbed gut microbial community
This article has been Reviewed by the following groups
Listed in
- Evaluated articles (Arcadia Science)
Abstract
Multi‐omics analyses are used in microbiome studies to understand molecular changes in microbial communities exposed to different conditions. However, it is not always clear how much each omics data type contributes to our understanding and whether they are concordant with each other. Here, we map the molecular response of a synthetic community of 32 human gut bacteria to three non‐antibiotic drugs by using five omics layers (16S rRNA gene profiling, metagenomics, metatranscriptomics, metaproteomics and metabolomics). We find that all the omics methods with species resolution are highly consistent in estimating relative species abundances. Furthermore, different omics methods complement each other for capturing functional changes. For example, while nearly all the omics data types captured that the antipsychotic drug chlorpromazine selectively inhibits Bacteroidota representatives in the community, the metatranscriptome and metaproteome suggested that the drug induces stress responses related to protein quality control. Metabolomics revealed a decrease in oligosaccharide uptake, likely caused by Bacteroidota depletion. Our study highlights how multi‐omics datasets can be utilized to reveal complex molecular responses to external perturbations in microbial communities.
Article activity feed
-
-
a baseline for integrating different data types in “in natura” settings.
I'm not sure this is entirely true since this study is from a synthetic community of species that are specifically picked and also have no strain diversity, so the dynamics are limited to species composition changes mostly and not dynamics of similar strains, or other factors such as phages, microbial eukaryotes, etc. The largest "natural" multi-omics studies I could think of that integrated different types of data were Woodcroft et al. from permafrost https://www.nature.com/articles/s41586-018-0338-1 and Herald et al. from wastewater: https://www.nature.com/articles/s41467-020-19006-2 (which includes metabolomics)
-
we find that all omics methods with species resolution in their readouts are highly consistent in estimating relative species abundances across conditions.
From the outset I find this quite surprising because there are a couple examples for low complexity enrichment communities or complex naturally occurring communities where this isn't the case that metagenomic/metatranscriptomic/metaproteomic data are concurrent with each other with what species are the most abundant corresponding to most "active" - is this because the synthetic community members were spiked in at the same abundance or the nature of these select synthetic communities exhibit "stable" behavior over time? I think some of these results are only applicable to synthetic communities for different multi-omics measurements being consistent with one another because of the …
we find that all omics methods with species resolution in their readouts are highly consistent in estimating relative species abundances across conditions.
From the outset I find this quite surprising because there are a couple examples for low complexity enrichment communities or complex naturally occurring communities where this isn't the case that metagenomic/metatranscriptomic/metaproteomic data are concurrent with each other with what species are the most abundant corresponding to most "active" - is this because the synthetic community members were spiked in at the same abundance or the nature of these select synthetic communities exhibit "stable" behavior over time? I think some of these results are only applicable to synthetic communities for different multi-omics measurements being consistent with one another because of the inherent makeup of the community
-
For metaproteomics data, we estimated species abundance
I don't think this is species abundance but rather the relative activity of that species based on protein intensity. I also think there are difficulties with calculating species "abundance" in this way because if I remember correctly you will sometimes have a redundant peptide (or a "core genome" protein) that can't be attributed back to a certain genome and those have to be tossed out, so this calculation is based on protein intensities for unique peptides attributed back to a particular genome, correct?
-
Our results show generally high consistency between omics data types in relative species abundance estimations, and underline that metaproteomics can, in principle, provide robust species abundance estimates, at least for synthetic microbial communities, albeit with lower sensitivity.
I think I might be confused by the overall framing - is the take home that these different methods should be consistent so that you could use any one alone for surveying a community and know it should give you the same information? Or that if they don't coincide which method is the most reliable for the information that is needed? Some of these conclusions for these methods seem to only be applicable to stable, simple synthetic communities
-
a baseline for integrating different data types in “in natura” settings.
I'm not sure this is entirely true since this study is from a synthetic community of species that are specifically picked and also have no strain diversity, so the dynamics are limited to species composition changes mostly and not dynamics of similar strains, or other factors such as phages, microbial eukaryotes, etc. The largest "natural" multi-omics studies I could think of that integrated different types of data were Woodcroft et al. from permafrost https://www.nature.com/articles/s41586-018-0338-1 and Herald et al. from wastewater: https://www.nature.com/articles/s41467-020-19006-2 (which includes metabolomics)
-
Our results show generally high consistency between omics data types in relative species abundance estimations, and underline that metaproteomics can, in principle, provide robust species abundance estimates, at least for synthetic microbial communities, albeit with lower sensitivity.
I think I might be confused by the overall framing - is the take home that these different methods should be consistent so that you could use any one alone for surveying a community and know it should give you the same information? Or that if they don't coincide which method is the most reliable for the information that is needed? Some of these conclusions for these methods seem to only be applicable to stable, simple synthetic communities
-
For metaproteomics data, we estimated species abundance
I don't think this is species abundance but rather the relative activity of that species based on protein intensity. I also think there are difficulties with calculating species "abundance" in this way because if I remember correctly you will sometimes have a redundant peptide (or a "core genome" protein) that can't be attributed back to a certain genome and those have to be tossed out, so this calculation is based on protein intensities for unique peptides attributed back to a particular genome, correct?
-
we find that all omics methods with species resolution in their readouts are highly consistent in estimating relative species abundances across conditions.
From the outset I find this quite surprising because there are a couple examples for low complexity enrichment communities or complex naturally occurring communities where this isn't the case that metagenomic/metatranscriptomic/metaproteomic data are concurrent with each other with what species are the most abundant corresponding to most "active" - is this because the synthetic community members were spiked in at the same abundance or the nature of these select synthetic communities exhibit "stable" behavior over time? I think some of these results are only applicable to synthetic communities for different multi-omics measurements being consistent with one another because of the …
we find that all omics methods with species resolution in their readouts are highly consistent in estimating relative species abundances across conditions.
From the outset I find this quite surprising because there are a couple examples for low complexity enrichment communities or complex naturally occurring communities where this isn't the case that metagenomic/metatranscriptomic/metaproteomic data are concurrent with each other with what species are the most abundant corresponding to most "active" - is this because the synthetic community members were spiked in at the same abundance or the nature of these select synthetic communities exhibit "stable" behavior over time? I think some of these results are only applicable to synthetic communities for different multi-omics measurements being consistent with one another because of the inherent makeup of the community
-