OncoReasoner: An Interpretable Regulatory Network Inference Framework for HPV E6/E7-Induced Transcriptomic Perturbations Leveraging Large Language Models

Youssef Ahmedm
Ruotong Luan

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Human papillomavirus (HPV) E6/E7 oncoproteins perturb host gene regulatory networks, driving oncogenesis. Existing computational methods often struggle to provide interpretable, chain-of-thought mechanistic explanations for observed transcriptomic changes. To address this, we introduce OncoReasoner, a novel framework that integrates biological expression analysis with the advanced reasoning capabilities of large language models (LLMs) and graph neural networks (GNNs). OncoReasoner comprises an Expression Encoder for rich gene embeddings, a Bio-LLM Reasoning Module for context-aware mechanistic explanations, and a Graph Refinement Module leveraging GNNs and prior knowledge for network consistency. Evaluated on diverse datasets, including GEO and TCGA, our framework significantly outperforms traditional statistical methods, GNNs, and other LLM baselines across differential gene expression classification, regulatory network edge prediction, and particularly, functional pathway reasoning. OncoReasoner notably achieves high accuracy in pathway identification and receives excellent expert ratings for its mechanistic explanations, demonstrating its superior ability to provide deep, accurate, and highly interpretable biological insights. An ablation study confirms the critical contribution of each module, and human evaluation further validates the qualitative excellence of its mechanistic explanations, marking a substantial advancement in explainable AI for cancer research.

Version published to 10.20944/preprints202603.0191.v1
Mar 4, 2026

Integrated transcriptomic and machine learning-driven analysis reveals high-confidence circular RNA biomarkers in Lung Adenocarcinoma

This article has 2 authors:
1. Ayushi Malviya
2. Rajabrata Bhuyan
This article has no evaluationsLatest version Feb 19, 2026
DeepCas12a: A hybrid deep learning framework for accurate Cas12a efficiency prediction from sequence and epigenetic information

This article has 6 authors:
1. Yiming Shi
2. Junkai Yin
3. Shurui Ning
4. Jinling Yuan
5. Degang Yang
6. Guohui Chuai
This article has no evaluationsLatest version Feb 9, 2026
Systems-Level Transcriptomic Integration Reveals a Core Metaflammatory Network Linking Type 2 Diabetes and HBV Infection to Cholangiocarcinoma Progression

This article has 12 authors:
1. Hasan Md Rasadul
2. Shihui Ma
3. Ziqiang Ge
4. Rahman Md Zahidur
5. Pengcheng Kang
6. Junqi You
7. Jinglin Li
8. Chenghong Duan
9. Siddique A. Z. M. Fahim
10. Mozumder Somrat Akbor
11. Xudong Zhao
12. Yunfu Cui
This article has no evaluationsLatest version Mar 12, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Integrated transcriptomic and machine learning-driven analysis reveals high-confidence circular RNA biomarkers in Lung Adenocarcinoma

DeepCas12a: A hybrid deep learning framework for accurate Cas12a efficiency prediction from sequence and epigenetic information

Systems-Level Transcriptomic Integration Reveals a Core Metaflammatory Network Linking Type 2 Diabetes and HBV Infection to Cholangiocarcinoma Progression