Towards universal modeling of transcript isoform expression levels

Savio Ho-Chit Chow
Christina Huan Shi
Aniruddha Deshpande
Qin Cao
Kevin Y. Yip

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

A holy grail in computational biology is accurate modeling of transcript expression levels using epigenetic features, which would provide a quantitative way to study gene regulation in normal and disease states. Previous studies relied heavily on immortalized cell lines that exhibit properties different from cells in natural tissue environments. Most studies also quantified the expression of each gene by a single expression level, which fails to capture separate expression levels of different transcript isoforms of the same gene. In this study, making use of the latest large-scale dataset of paired transcriptomic and epigenomic data of human samples produced by the International Human Epigenome Consortium (IHEC), we computationally modeled the expression levels of individual transcript isoforms in 324 samples from 29 tissue types. We constructed the models using graph-based methods that integrate both location-specific epigenomic features and multiple types of gene-gene relationships. We found that to infer transcript isoform expression levels in a sample, a model that integrates information from many samples of other tissue types consistently outperforms a model trained on data from this sample itself, providing strong support that it is possible to construct a “universal” model that can accurately infer transcript isoform expression levels across tissue types.

Version published to 10.1101/2025.07.21.665977 on bioRxiv
Jul 25, 2025

Cell-type-specific transcriptomic signatures associated with Alzheimer’s disease in the ROSMAP cohort: a single-nucleus RNA-seq pseudobulk analysis.

This article has 1 author:
1. Jose Israel Nadal Vidal
This article has no evaluationsLatest version Jan 6, 2026
An integrated single-cell transcriptomic dataset for Mouse cortex

This article has 8 authors:
1. Xuefeng Shi
2. Zhihui Qi
3. Hong Huang
4. Zhiming Ye
5. YuMin Wu
6. Kahei Chan
7. Maojin Yao
8. Zhongxing Wang
This article has no evaluationsLatest version Dec 18, 2025
Comparative transcriptomic analysis defines shared and mammary-specific gene expression programs across glandular tissues

This article has 2 authors:
1. Marie Saitou
2. Guro Katrine Sandvik
This article has no evaluationsLatest version Dec 29, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Cell-type-specific transcriptomic signatures associated with Alzheimer’s disease in the ROSMAP cohort: a single-nucleus RNA-seq pseudobulk analysis.

An integrated single-cell transcriptomic dataset for Mouse cortex

Comparative transcriptomic analysis defines shared and mammary-specific gene expression programs across glandular tissues