Orchestrating Microbiome Analysis with Bioconductor
Discuss this preprint
Start a discussion What are Sciety discussions?Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
The expansion of microbiome research has led to the accumulation of interlinked datasets encompassing versatile taxonomic and functional assays. While critical to advance the field, the analysis of increasingly large and heterogeneous multi-modal microbiome data would benefit from unified approaches supporting the design of modular data science workflows through interoperable methods. The Bioconductor project has recently developed an optimized statistical programming framework for multi-assay data integration. Building on this foundation, we introduce a community-developed open source ecosystem for microbiome data science. In contrast to the previous alternatives, the methodology is specifically designed to support joint analysis of hierarchical, interlinked, and heterogeneous multi-table datasets that are increasingly common in modern microbiome research. This data science ecosystem encompasses open data, methods, tutorials, and an active online community. These resources support standardized and reproducible data wrangling, joint analysis, and reporting. We have detailed the unctionality and usage in the online book (https://microbiome.github.io/OMA/docs/devel), which offers guidance for prospective users and contributors.