metaboprep v2: Broadening the application of the metaboprep beyond metabolomics
Discuss this preprint
Start a discussion What are Sciety discussions?Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
High-throughput multiplex assays for metabolomics and proteomics offer opportunities for biomarker discovery and disease stratification in epidemiological research. The complexity of these datasets requires robust, standardized and transparent preprocessing workflows to ensure reproducibility and comparability across studies. We present an updated and enhanced version of the metaboprep R package. Originally designed for metabolomics data, it has now been extended to support proteomics datasets from platforms such as Olink® and SomaScan®. This release introduces a user-friendly, modular, object-oriented architecture using R’s S7 system, enabling improved input format flexibility, streamlined report generation and increased compatibility with other third-party tools. The updated pipeline is structured in three parts: data import, filtering and summary, and output generation. This structure provides a reproducible yet customizable framework for pre-analysis data preparation with utility across multiple omics platforms and particular value in supporting multi-cohort epidemiological research.
Availability and implementation
The metaboprep package is implemented in R and freely available at: https://github.com/MRCIEU/metaboprep .