Tier-based standards for FAIR sequence data and metadata sharing in microbiome research
Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
Microbiome research is a growing, data-driven field within the life sciences. While policies exist for sharing microbiome sequence data and using standardized metadata schemes, compliance among researchers varies. To promote open research data best practices in the microbiome research community, we (1) propose two tiered badge systems to evaluate data/metadata sharing compliance, and (2) showcase an automated evaluation tool to determine adherence to data reporting standards in publications with amplicon and metagenome sequence data. In a systematic evaluation of publications (n ∼ 3000) spanning human gut microbiome research, we found that nearly half of publications do not meet minimum standards for sequence data availability. Moreover, poor standardization of metadata creates a high barrier to harmonization and cross-study comparison. Using this badge system and evaluation tool, our proof-of-concept work exposes (i) the ineffectiveness of sequence data availability statements, and (ii) the lack of consistent metadata standards for annotating microbial data. In this Perspective, we highlight the need for improved practices and infrastructure that reduce barriers to data submission and maximize reproducibility in microbiome research. We anticipate our tiered badge framework will promote dialogue regarding data sharing practices and facilitate microbiome data reuse, supporting best practices that make microbiome data FAIR.