EC-Bench: A Benchmark for Enzyme Commission Number Prediction

Saeedeh Davoudi
Christopher S. Henry
Christopher S. Miller
Farnoush Banaei-Kashani

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Enzymes are proteins that catalyze specific biochemical reactions in cells. Enzyme Commission (EC) numbers are used to annotate enzymes in a four-level hierarchy that classifies enzymes based on the specific chemical reactions they catalyze. Accurate EC number prediction is essential for understanding enzyme functions. Despite the availability of numerous methods for predicting EC numbers from protein sequences, there is no unified framework for evaluating and studying such methods systematically. This gap limits the ability of the community to identify the most effective approaches for enzyme annotation. We introduce EC-Bench, a benchmark for EC number prediction, consisting of 1) an initial representative set of existing methods (including homology-based, deep learning, contrastive learning, and language model methods), 2) existing and novel accuracy and efficiency performance metrics, and 3) selected datasets to allow for comprehensive comparative study. EC-Bench is open-source and provides a framework for researchers to not only compare among existing methods objectively under uniform conditions, but also to introduce and effectively evaluate performance of new methods in a comparative frame-work. To demonstrate the utility of EC-Bench, we perform extensive experimentation to compare the existing EC number prediction methods and establish their advantages and disadvantages in a variety of prediction tasks, namely “exact EC number prediction”, “EC number completion” and (partial or additional) “EC number recommendation”. We find wide variation in the performance of different methods, but also subtle but potentially useful differences in the performance of different methods across tasks and for different parts of the EC hierarchy.

Version published to 10.1101/2025.06.25.661207v1 on bioRxiv
Jun 28, 2025

RC-GNN: A predictive model of enzyme-reaction pairs

This article has 4 authors:
1. Stefan C. Pate
2. Eric H. Wang
3. Linda J. Broadbelt
4. Keith E.J. Tyo
This article has no evaluationsLatest version Jun 27, 2025
Squidly: Enzyme Catalytic Residue Prediction Harnessing a Biology-Informed Contrastive Learning Framework

This article has 4 authors:
1. William JF Rieger
2. Mikael Boden
3. Frances Arnold
4. Ariane Mora
This article has no evaluationsLatest version Jun 20, 2025
SAKPE: A Site Attention Kinetic Parameters Prediction Method for Enzyme Engineering

This article has 8 authors:
1. Jia-He Qiu
2. Zongying Lin
3. Ke-Wei Chen
4. Tian-Yu Sun
5. Xian Zhang
6. Li Yuan
7. Yonghong Tian
8. Yun-Dong Wu
This article has no evaluationsLatest version May 6, 2025

Listed in

Abstract

Article activity feed

Related articles

RC-GNN: A predictive model of enzyme-reaction pairs

Squidly: Enzyme Catalytic Residue Prediction Harnessing a Biology-Informed Contrastive Learning Framework

SAKPE: A Site Attention Kinetic Parameters Prediction Method for Enzyme Engineering