Mapping active cis-regulatory elements from transcription initiation events
Discuss this preprint
Start a discussion What are Sciety discussions?Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
Determining the activity of cis-regulatory elements (CREs) is essential for modeling gene regulation and interpreting genetic variation. Yet, current methods often lack the specificity to distinguish active regulation from permissive chromatin, the sensitivity to detect unstable enhancer RNAs, or the scalability required to profile limited input material and primary cells. Here, we introduce nucCAGE, a transcription start site (TSS) assay for profiling nuclear, capped RNAs, and PRIME, a computational framework for identifying active CREs from TSS data. Together, these methods increase sensitivity to low-abundance RNAs and enable robust detection of active regulatory elements across diverse contexts. Across multiple orthogonal functional and genetic benchmarks, including fine-mapped eQTLs, ClinVar variants, GWAS loci, and CRISPRi-tested elements, nucCAGE-derived PRIME predictions achieve superior recall compared to state-of-the-art methods while maintaining strong enrichment for phenotype-associated variation. Applying PRIME to the FANTOM5 dataset yields a comprehensive, cell-type-resolved atlas of active CREs that recapitulates known tissue-trait relationships. We demonstrate how this atlas can be used to nominate causal noncoding variants, linking immune-cell enhancer regulation of SMAD3 to asthma and NCOR2 to premature separation of placenta. Together, nucCAGE and PRIME provide a framework for high-sensitivity genome-wide discovery of active CREs and a resource for variant-to-function studies.