EVODEX: A Mechanistic Framework for Extracting, Structuring, and Predicting Enzymatic Reactivity
Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
Accurately modeling enzymatic reactivity is essential for synthesis planning, mass spectrometry interpretation, and biochemical knowledge extraction, but existing reaction datasets are often incomplete or inconsistently annotated. EVODEX introduces two abstractions to address this: partial reactions, which isolate atom-mapped substrate-product transformations from complex biochemical reactions, and Electronic Reaction Operators (EROs), which encode the bonding and orbital environment of atoms undergoing change. From 349,458 curated reactions in EnzymeMap, EVODEX constructs over 186,000 partial reactions and mines a core set of 1,404 EROs, each supported by at least 10 examples. Together, these explain 91 percent of test reactions at the formula level and 62 percent at the mechanistic level. A further filtered synthesis subset of 436 one-to-one operators that exclude ubiquitous metabolites provides a compact and generalizable vocabulary of single-step enzymatic transformations. EVODEX operators are auditable, reusable, and available as part of an open-source Python package with datasets, Jupyter notebooks, and a browsable website.