Disentangling Protein Function via Decoupled Information Theoretic Selection of Key Tuning Residues

Haris Saeed
Aidong Yang
Wei E. Huang

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Rational protein engineering requires identifying residues that modulate function without disrupting functionality, a key challenge in protein engineering. Existing computational methods struggle to distinguish genuine functional sites from positions coevolving due to structural constraints, leading to high false-discovery rates. Here we present an information-theoretic decoupling framework that, without machine learning, isolates key tuning residues by computationally “denoising” sequence data, iteratively removing confounding evolutionary signals to reveal underlying functional sites. We validated this framework across 10 datasets spanning enzymes, fluorescent proteins, and antibodies. In a nanobody-antigen binding case study, our method identified > 25% (6/20) of verified binding-critical residues ( p = 0.031), while the best of five benchmarked tools found zero. Performance was consistent across all datasets, with supervised variants achieving large effect sizes (Hedges’ g > 0.7, p < 0.01) and unsupervised variants also showing gains ( g > 0.2, p < 0.05) over benchmarks. This interpretable framework provides a generalizable method to accelerate protein design, from focusing antibody maturation to optimizing biocatalysts.

Version published to 10.1101/2025.05.28.653817 on bioRxiv
May 28, 2025

Feature-Optimized Machine Learning Benchmarking for Protein Interface Prediction in Permanent Homodimer Complexes with Distinct Structural Features

This article has 4 authors:
1. Tayyip Topuz
2. Zeki Erdem
3. Halil Bisgin
4. E. Demet Akten
This article has no evaluationsLatest version Feb 2, 2026
Integrating Evolutionary and Compositional Features with ML and DL for Robust and Interpretable Druggable Protein Prediction

This article has 5 authors:
1. Mujeebu Rehman
2. Qinghua Liu
3. Muhammad Javed
4. Ali Ghulam
5. Teerath Kumar
This article has no evaluationsLatest version Dec 11, 2025
Artificial Intelligence–Driven Structural Mining Enables Functional Inference in the Human Dark Proteome

This article has 7 authors:
1. Valentina Carbonari
2. Annamaria Defilippo
3. Ugo Lomoio
4. Caterina Francesca Perri
5. Barbara Puccio
6. Pierangelo Veltri
7. Pietro Hiram Guzzi
This article has no evaluationsLatest version Dec 23, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Feature-Optimized Machine Learning Benchmarking for Protein Interface Prediction in Permanent Homodimer Complexes with Distinct Structural Features

Integrating Evolutionary and Compositional Features with ML and DL for Robust and Interpretable Druggable Protein Prediction

Artificial Intelligence–Driven Structural Mining Enables Functional Inference in the Human Dark Proteome