Tandem repeat variation shapes immune cell type-specific gene expression
Discuss this preprint
Start a discussion What are Sciety discussions?Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
Tandem repeats (TRs) - highly polymorphic, repetitive sequences across the human genome - are important regulators of gene expression but remain underexplored due to challenges in accurate genotyping and analysis. Here, we generate new whole genome and single-cell RNA sequencing from >5.4 million blood-derived cells across 1,925 individuals in two cohorts, and perform meta-analysis to characterize the impact of variation in >1.7 million TR loci on immune cell type-specific gene expression. We identify >69,000 single-cell expression TR loci (sc-eTRs), 30.7% of which are specific to one of 28 immune cell types, and reveal dynamic regulatory effects using cell-state inference. Matched single-cell ATAC-seq profiles from >3.4 million nuclei in 922 individuals uncover chromatin accessibility QTLs for nearly one-third of expression-associated TRs, supporting coordinated effects on cis-regulatory architecture. Fine-mapping implicates 1,490 TRs as candidate causal drivers of gene expression in 6.1% of tested genes, and colocalization analyses highlight >200 genes in which TRs likely mediate genetic associations with immune and hematological traits. Together, these results provide a genome-wide, multiomic view of TR-mediated regulation in the human immune system, establishing TRs as key contributors to cell type-specific regulatory variation and complex trait architecture.