ProtLoc-GRPO: Cell line-specific subcellular localization prediction using a graph-based model and reinforcement learning

Shuai Zeng
Weinan Zhang
Chaohan Li
Yuexu Jiang
Duolin Wang
Qing Shao
Dong Xu

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Subcellular localization prediction is crucial for understanding protein functions and cellular processes. Subcellular localization is dependent on tissue and cell lines derived from different cell types. Predicting cell line-specific subcellular localization using the information of protein-protein interactions (PPIs) offers deeper insights into dynamic cellular organization and molecular mechanisms. However, many existing PPI networks contain systematic errors that limit prediction accuracy. In this study, we propose a reinforcement learning approach, ProtLoc-GRPO, to enhance subcellular localization prediction by optimizing the structure of the underlying PPI network. ProtLoc-GRPO learns to rank and retain the most informative PPI edges to maximize the macro-F1 score for cell line-specific subcellular localization. Our approach yields a 7% improvement in macro-F1 score over the baseline. We further evaluate its robustness across various edge pruning rates and benchmark it against conventional pruning strategies. Results show that our proposed method consistently outperforms existing approaches. To our knowledge, this work represents the first study to predict cell line-specific protein subcellular localization and the first application of the Group Relative Policy Optimization (GRPO) framework to a graph-based model for bioinformatics tasks.

Version published to 10.1101/2025.07.17.665451 on bioRxiv
Jul 22, 2025

Discovering cell types and states from reference atlases with heterogeneous single-cell ATAC-seq features

This article has 2 authors:
1. Xiuwei Zhang
2. Yuqi Cheng
This article has no evaluationsLatest version Dec 10, 2025
Artificial Intelligence–Driven Structural Mining Enables Functional Inference in the Human Dark Proteome

This article has 7 authors:
1. Valentina Carbonari
2. Annamaria Defilippo
3. Ugo Lomoio
4. Caterina Francesca Perri
5. Barbara Puccio
6. Pierangelo Veltri
7. Pietro Hiram Guzzi
This article has no evaluationsLatest version Dec 23, 2025
A Survey on Efficient Protein Language Models

This article has 8 authors:
1. Shouren Wang
2. Debargha Ganguly
3. Vinooth Kulkarni
4. Wang Yang
5. Zhuoran Qiao
6. Daniel Blankenberg
7. Vipin Chaudhary
8. Xiaotian Han
This article has no evaluationsLatest version Dec 24, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Discovering cell types and states from reference atlases with heterogeneous single-cell ATAC-seq features

Artificial Intelligence–Driven Structural Mining Enables Functional Inference in the Human Dark Proteome

A Survey on Efficient Protein Language Models