A genomic atlas of the human gut virome elucidates genetic factors shaping host interactions
Discuss this preprint
Start a discussion What are Sciety discussions?Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
Viruses are key modulators of human gut microbiome composition and function. While metagenomic sequencing has enabled culture-independent discovery of gut bacteriophage diversity, existing genomic catalogues suffer from limited geographic representation, sparse taxonomic classification, and insufficient functional annotation, hindering detailed investigation into phage biology. Here, we present the Unified Human Gastrointestinal Virome (UHGV), a collection of 873,994 viral genomes from globally diverse populations that addresses these limitations. UHGV provides high-quality virome references with extensive host predictions, comprehensive functional annotations, protein structures, a classification framework for comparative analysis, and a web portal to facilitate data access. Using UHGV to profile worldwide metagenomes, we found that host range breadth is strongly associated with phage prevalence. Additionally, we identified diversity-generating retroelements and DNA methyltransferases as key factors enabling phage populations to access diverse hosts, revealing how specific genomic features contribute to global phage distribution patterns. UHGV is available at http://uhgv.jgi.doe.gov.