Machine Learning-Based Assessment of the Healthy Human Gut Mycobiota Landscape Using ITS1 DNA Metabarcoding Data
Discuss this preprint
Start a discussion What are Sciety discussions?Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
The human gut microbiome plays a critical role in maintaining host health and homeostasis, and current literature suggests a bidirectional relationship between microbiome ecology and host well-being. DNA metabarcoding has emerged as a powerful tool for investigating microbiome imbalances (i.e., dysbiosis). While the prokaryotic microbiome has been extensively studied, the fungal counterpart – or mycobiome – remains largely unexplored, despite its recognized role from the perinatal stage onward. Here, we present a comprehensive survey based on DNA metabarcoding analysis of approximately 1,500 publicly available ITS1 samples. This survey integrates conventional statistical approaches with Machine Learning (ML) methods coupled with explainable Artificial Intelligence (XAI). ML models successfully predicted host health status with accuracies exceeding 80%, and fungal genera such as Eurotium , Aureobasidium , Candida , and Cutaneotrichosporon emerged as key classification features. This study introduces a cutting-edge multiview analytical framework applied to publicly available mycobiome data, highlighting the potential of fungal community profiling as a non-invasive tool to support health diagnostics.