Using the simple telegraph model to decipher transcriptional burst regulation across genome-wide data

Read the full article See related articles

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.
Log in to save this article

Abstract

Gene transcription is a stochastic bursting process, with burst frequency and burst size as core parameters. Deciphering its genome-wide regulation is challenging, as complex models (capturing detailed biology) are impractical for large-scale use due to unstable inference and high computation. We proposed a simple telegraph model-based framework to analyze genome-wide scRNA-seq and smFISH datasets, addressing these limitations. When sample size ≥ 500 and burst parameter change ≥ 3-fold, its inferred burst frequency- and size-dominated variations reliably proxy true regulation. Analyses across mouse cells (CAST/C57 alleles, fibroblasts vs. embryonic stem cells) and healthy/hypertrophic cardiomyopathy (HCM) human heart tissues revealed key rules: 1) Burst frequency- or size-dominated regulation was primary (over 70% of variable genes), with more bursty-regulated genes in HCM; 2) TATA-initiator synergy (enhancing burst size-dominated regulation) was lost in HCM; 3) Burst frequency-dominated genes enriched in genome stability/cell cycle/apoptosis (via TFs like Foxo4 / Mcm2 ), while burst size-dominated ones in signaling/metabolism/proteostasis (via TFs like Zfp322a / Smchd1 / Ppargc1a ), and their interdependent dysregulation accelerated HCM. This study establishes the simple telegraph model as a scalable framework linking transcriptional burst dynamics to cell fate and pathology.

Article activity feed