GRNITE: Gene Regulatory Network Inference with Text Embeddings

Ali Azizpour
Narein Rao
Santiago Segarra
Luay Nakhleh
Nicolae Sapoval

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Gene regulatory networks (GRNs) capture complex regulatory relationships that govern gene expression in cells. Inference of GRNs from single-cell RNA-seq (scRNA-seq) data has been an active topic of research in the past several years. However, despite the improvements in the data quality, the GRN inference problem remains a challenging task with many approaches showing variable performance dependent on the organism and cell type. To improve the quality of GRN inference and enable more comprehensive exploratory analyses of GRNs across various phenotypes of interest, we developed a two-stage meta-method called GRNITE. In the first step, GRNITE leverages LLM-based embeddings of plain text gene descriptions to create a prior gene interaction graph, which is then optimized with a graph neural network (GNN) to achieve a “universal” biological prior for GRN inference. In the second step, GRNITE uses a GNN to incorporate information from a GRN inferred from scRNA-seq data with any baseline inference method into our prior. The result of this two-step approach is a near-universal improvement in AUROC and recall of all evaluated methods, with minor trade-offs in precision. Furthermore, GRNITE is a lightweight meta-method, which adds a minimal amount of extra compute time on top of the original GRN inference performed. GRNITE and our pre-trained universal prior GRN are available on GitHub: https://github.com/aliaaz99/GRNITE .

Version published to 10.1101/2025.11.25.690454 on bioRxiv
Nov 28, 2025

DAG-VAERL: a novel causal inference method for building causal gene regulatory network

This article has 3 authors:
1. Teng Long
2. Sachit Satyal
3. Jean Gao
This article has no evaluationsLatest version Jan 22, 2026
Uncovering miRNA–Disease Associations Through Graph Based Neural Network Representations

This article has 1 author:
1. Alessandro Orro
This article has no evaluationsLatest version Jan 28, 2026
In-Context Learning in Genomic Language Models as a Biological Evaluation Task

This article has 2 authors:
1. Aadit Kapoor
2. Wendy Lee
This article has no evaluationsLatest version Dec 9, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

DAG-VAERL: a novel causal inference method for building causal gene regulatory network

Uncovering miRNA–Disease Associations Through Graph Based Neural Network Representations

In-Context Learning in Genomic Language Models as a Biological Evaluation Task