Localized Reactivity on Proteins as Riemannian Manifolds: A Quantum-Inspired Geometric Model for Deterministic, Metal-Aware Reactive-Site Prediction

Read the full article See related articles

Listed in

This article is not in any list yet, why not save it to one of your lists.
Log in to save this article

Abstract

We present a unified framework for protein reactive-site prediction that couples a rigorous geometric, quantum-inspired model with a metal-aware, fully deterministic implementation . Proteins are treated as smooth Riemannian manifolds; each residue is equipped with a localized fiber, whose environment vector encodes geometric and physico-chemical features. Motivated by Density Functional Theory—which links global electron density to reactivity—we hypothesize that a sufficiently rich local density stencil captures the same information at the residue scale. On this foundation we build a single-file Python toolkit, reactive_site_predictor.py , which ranks candidate reactive residues in both metallo- and apo-proteins. The score combines (i) a geometry-based environment term, (ii) relativistic ZORA distance factors for metal centers, (iii) coordination-number and angle-deviation corrections, and (iv) an optional pocket-proximity flag. A fixed global random seed removes all stochastic elements, guaranteeing reproducibility . Benchmarks on three representative assemblies—Rubisco, GroEL, and SecA—achieve top-10 recall between 90 and 50 percent for documented functional residues. While promising, the current validation is limited to three static PDB structures; extension to dynamic ensembles and explicit quantum layers is left to future work. The framework thus offers an interpretable, physics-grounded, and deterministic paradigm for protein-function prediction. Beyond residue-level reactivity, the same manifold–fiber formalism is ultimately intended for purely geometric analysis of protein–protein interfaces : instead of assigning scalar scores, we construct an explicit contact sub-manifold and study its curvature, topology, and fibre-overlap signatures to deduce interaction logic without additional learning. Benchmarks on sixteen diverse protein assemblies (three originals + thirteen new targets) show top-10 recall between 50%and 100 %, underscoring the framework’s generality. Across the full benchmark of sixteen proteins, the method recovers 134 literature-supported residues among 160 top-rank predictions, corresponding to a global Top-10 precision of 0.84.

Article activity feed