Localized Reactivity on Proteins as Riemannian Manifolds: A Quantum-Inspired Geometric Model for Deterministic, Metal-Aware Reactive-Site Prediction
Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
We present a unified framework for protein reactive-site prediction that couples a rigorous geometric, quantum-inspired model with a metal-aware, fully deterministic implementation . Proteins are treated as smooth Riemannian manifolds; each residue is equipped with a localized fiber, whose environment vector encodes geometric and physico-chemical features. Motivated by Density Functional Theory—which links global electron density to reactivity—we hypothesize that a sufficiently rich local density stencil captures the same information at the residue scale. On this foundation we build a single-file Python toolkit, reactive_site_predictor.py , which ranks candidate reactive residues in both metallo- and apo-proteins. The score combines (i) a geometry-based environment term, (ii) relativistic ZORA distance factors for metal centers, (iii) coordination-number and angle-deviation corrections, and (iv) an optional pocket-proximity flag. A fixed global random seed removes all stochastic elements, guaranteeing reproducibility . Benchmarks on three representative assemblies—Rubisco, GroEL, and SecA—achieve top-10 recall between 90 and 50 percent for documented functional residues. While promising, the current validation is limited to three static PDB structures; extension to dynamic ensembles and explicit quantum layers is left to future work. The framework thus offers an interpretable, physics-grounded, and deterministic paradigm for protein-function prediction. Beyond residue-level reactivity, the same manifold–fiber formalism is ultimately intended for purely geometric analysis of protein–protein interfaces : instead of assigning scalar scores, we construct an explicit contact sub-manifold and study its curvature, topology, and fibre-overlap signatures to deduce interaction logic without additional learning. Benchmarks on sixteen diverse protein assemblies (three originals + thirteen new targets) show top-10 recall between 50%and 100 %, underscoring the framework’s generality. Across the full benchmark of sixteen proteins, the method recovers 134 literature-supported residues among 160 top-rank predictions, corresponding to a global Top-10 precision of 0.84.