G-SPRI: A Structure-Centric Graph Model for Comprehensive Prediction of Cancer Driver Events from Missense Mutations

Boshen Wang
Ali M. Farhat
Bowei Ye
Jie Liang
Lei Yu
Zeyu Lu
Xinlei Wang
Lin Xu

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

In silico approaches for predicting the functional impact of missense mutations are critical for interpreting personal genomes and identifying disease-related biomarkers. Existing methods largely rely on sequence-based information or intuitive structural features, but often overlook the complex biophysical patterns encoded in protein 3D structures. Here, we present G-SPRI, a multilevel framework built on a novel alpha-shape protein graph that accurately captures residue connectivity from atomic-resolution geometry and enables precise message passing around mutation sites. Using this graph representation, G-SPRI integrates wild-type structural properties and mutation-specific perturbation signals derived from the Protein Data Bank (PDB) universe to support graph-based learning for distinguishing pathogenic from benign missense variants. G-SPRI performs strongly across multiple key tasks. On the binary prediction benchmark, G-SPRI delivers improved pathogenicity prediction for individual mutations. By integrating mutation recurrence across the pan-cancer cohort, G-SPRI recovers more known cancer driver genes than state-of-the-art methods from more than 2.3 million mutations. Furthermore, by jointly quantifying site-specific pathogenicity and co-clustering influence within higher-order structural organization units, G-SPRI provides comprehensive evidence for pinpointing likely driver mutations and structurally susceptible regions within disease genes.

Version published to 10.64898/2026.05.06.723398 on bioRxiv
May 11, 2026

LNGCN: A Distance-Aware Dynamics Network for Protein-Protein Interaction Prediction

This article has 16 authors:
1. Yueming Xiao
2. Yifan Zheng
3. Yu Hua
4. Jiahua Peng
5. Jinliang Liu
6. Yuan Qu
7. Jizhuang Xu
8. Rao Fu
9. Qiuting Qian
10. Make Zhao
11. Xinxin Zhang
12. Jingjing Zhao
13. Yifei Yao
14. Martin Kosar
15. Yuehai Ke
16. Ying Chi
This article has no evaluationsLatest version May 4, 2026
eSIG-Net: Accurate prediction of single-mutation induced perturbations on protein interactions using a language model

This article has 17 authors:
1. Xingxin Pan
2. Aditya Shrawat
3. Sidharth Raghavan
4. Chuanpeng Dong
5. Yuntao Yang
6. Zhao Li
7. W. Jim Zheng
8. S. Gail Eckhardt
9. Erxi Wu
10. Juan I. Fuxman Bass
11. Daniel F. Jarosz
12. Sidi Chen
13. Daniel J. McGrail
14. Gloria M. Sheynkman
15. Jason H. Huang
16. Nidhi Sahni
17. S. Stephen Yi
This article has no evaluationsLatest version Mar 31, 2026
EVEE: Interpretable variant effect prediction from genomic foundation model embeddings

This article has 22 authors:
1. Michael T. Pearce
2. Thomas Dooms
3. Ryo Yamamoto
4. Joshua Meehl
5. Carl Molnar
6. Mark Bissell
7. Dron Hazra
8. Ching Fang
9. Nam Nguyen
10. Michael Anderson
11. Collin Osborne
12. Patrick Duffy
13. Bridget Toomey
14. Eric Klee
15. Elena Myasoedova
16. Alexander J. Ryu
17. Shant Ayanian
18. Panos Korfiatis
19. Matt Redlon
20. Archa Jain
21. Daniel Balsam
22. Nicholas K. Wang
This article has no evaluationsLatest version Apr 11, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

LNGCN: A Distance-Aware Dynamics Network for Protein-Protein Interaction Prediction

eSIG-Net: Accurate prediction of single-mutation induced perturbations on protein interactions using a language model

EVEE: Interpretable variant effect prediction from genomic foundation model embeddings