Advances in protein function prediction from the fifth CAFA challenge

M. Clara De Paolis Kaluza
Rashika Ramola
Parnal Joshi
Damiano Piovesan
Walter Reade
Sandra Orchard
Maria J Martin
Alex Ignatchenko
Burkhard Rost
Christine A Orengo
Marc Robinson-Rechavi
Dannie Durand
Steven E Brenner
Casey S Greene
Sean D Mooney
Iddo Friedberg
Predrag Radivojac

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

The Critical Assessment of Function Annotation (CAFA) is a long-standing community effort to independently assess computational methods for protein function prediction, to highlight well-performing methodologies, to identify bottlenecks in the field, and to provide a forum for the dissemination of results and exchange of ideas. In its fifth round (CAFA 5) of triennial challenges, a partnership with Kaggle Inc. facilitated participation from a large community of data scientists and computational biologists through a competitive prospective challenge on the crowdsourcing platform. In this work, we present an in-depth analysis of the submitted predictions and report improvements in accuracy over all methods from the previous CAFA challenges. We further introduce a new evaluation setting for proteins with pre-existing (incomplete) annotations and identify the need for methods that better leverage existing annotations to predict those that will be discovered later. Finally, we characterize the prospective evaluation framework by examining performance on a strict set of unpublished annotations and across intermediate database releases. Our results indicate that recent developments in the field, such as the availability of protein language models and accurately predicted 3D structures, as well as the growth of experimental annotations through biocuration, have all contributed to performance improvements.

Version published to 10.64898/2026.04.27.716980 on bioRxiv
Apr 30, 2026

On the state of protein function prediction: a report on the fourth CAFA challenge

This article has 154 authors:
1. Rashika Ramola
2. M. Clara De Paolis Kaluza
3. Damiano Piovesan
4. Yisu Peng
5. Parnal Joshi
6. Mahta Mehdiabadi
7. Federica Quaglia
8. Rita Pancsa
9. Lucía B. Chemes
10. Meisam Ahmadi
11. Hongryul Ahn
12. Adrian M. Altenhoff
13. Ehsaneddin Asgari
14. Maria Cristina Aspromonte
15. Volkan Atalay
16. Giulia Babbi
17. Davide Baldazzi
18. Meet M. Barot
19. Asa Ben-Hur
20. Alfredo Benso
21. Daniel Berenberg
22. Jari Björne
23. Florian Boecker
24. Paolo Boldi
25. Joseph Bonello
26. Nicola Bordin
27. Piyush Borole
28. Ali Ebrahimpour Boroojeny
29. Renzhi Cao
30. Stefano Di Carlo
31. Rita Casadio
32. Elena Casiraghi
33. Jia-Ming Chang
34. Chen Chen
35. Tse-Ming Chen
36. Jianlin Cheng
37. Ssu Chiu
38. Alperen Dalkıran
39. Radoslav S. Davidović
40. Christophe Dessimoz
41. Rucheng Diao
42. Warith Eddine Djeddi
43. Tunca Dogan
44. Sean T. Flannery
45. Paolo Fontana
46. Marco Frasca
47. Lydia Freddolino
48. Branislava Gemović
49. Jesse Gillis
50. Filip Ginter
51. Vladimir Gligorijevic
52. Giuliano Grossi
53. Michael Heinzinger
54. Kyle Hippe
55. Robert Hoehndorf
56. Liisa Holm
57. Jie Hou
58. John R. Hover
59. Yen-Ting Huang
60. Emilio Ispano
61. Suraiya Jabin
62. Aashish Jain
63. David T. Jones
64. Suwisa Kaewphan
65. Yuki Kagaya
66. Jenna Kanerva
67. Daisuke Kihara
68. Maxat Kulmanov
69. Sunil Kumar
70. Lukasz Kurgan
71. Enrico Lavezzo
72. Jon Lees
73. Wen-Hung Liao
74. Han Lin
75. Michal Linial
76. Maria Littmann
77. Lizhi Liu
78. Tong Liu
79. Yi Wei Liu
80. Stavros Makrodimitris
81. Laura Manuto
82. Pier Luigi Martelli
83. Alice Carolyn Mchardy
84. Gabriela A. Merino
85. Diego H. Milone
86. Sarthak Mishra
87. Mohammad R. K. Mofrad
88. David Moi
89. Tsukasa Nakamura
90. Vijay Kumar Narsapuram
91. Maria Victoria Nugnes
92. Takeshi Obayashi
93. Dan Ofer
94. Alberto Paccanaro
95. Vladimir R. Perovic
96. Alessandro Petrini
97. Gianfranco Politano
98. Daniele Raimondi
99. Nadav Rappoport
100. Hafeez Ur Rehman
101. Maarten J. M. F. Reijnders
102. Marcel J. T. Reinders
103. P. Douglas Renfrew
104. Ahmet S. Rifaioglu
105. Alfonso E. Romero
106. Abhiman Saraswathi
107. Castrense Savojardo
108. Harry M. Scholes
109. Heiko Schoof
110. Yang Shen
111. Ian Sillitoe
112. Georgina Stegmayer
113. Amos Stern
114. Henri Tiittanen
115. Sumyyah Toonsi
116. Stefano Toppo
117. Petri Toronen
118. Mateo Torres
119. Gabriella Trucco
120. Giorgio Valentini
121. Nevena Veljkovic
122. Alex Warwick Vesztrocy
123. Vedrana Vidulin
124. Amelia Villegas-Morcillo
125. Antti Virtanen
126. Wim Vranken
127. Slobodan Vucetic
128. Cen Wan
129. Zheng Wang
130. Mark N. Wass
131. Robert M. Waterhouse
132. Sadok Ben Yahia
133. Haixuan Yang
134. Shuwei Yao
135. Ronghui You
136. Jeffrey Yunes
137. Chengxin Zhang
138. Yang Zhang
139. Chenguang Zhao
140. Xiaogen Zhou
141. Yi-Heng Zhu
142. Shanfeng Zhu
143. Hao Zhu
144. Gökhan Özsari
145. Burkhard Rost
146. Christine Orengo
147. Marc Robinson-Rechavi
148. Dannie Durand
149. Steven E. Brenner
150. Casey S. Greene
151. Sean D. Mooney
152. Silvio C. E. Tosatto
153. Iddo Friedberg
154. Predrag Radivojac
This article has no evaluationsLatest version May 11, 2026
DIOPT: the DRSC Integrative Ortholog Prediction Tool, 2026 update

This article has 7 authors:
1. Yanhui Hu
2. Aram Comjean
3. Chenxi Gao
4. Austin Veal
5. Shinya Yamamoto
6. Stephanie E. Mohr
7. Norbert Perrimon
This article has no evaluationsLatest version Apr 16, 2026
Discriminative Site-Directed Protein Engineering via Lightweight CASPE Platform

This article has 10 authors:
1. Qiufeng Deng
2. Jie Qiao
3. Chuan Wang
4. Xinyue Ni
5. Yongyao Chang
6. Nan Zhao
7. Rui Zhai
8. Haiyang Cui
9. Xiujuan Li
10. Mingjie Jin
This article has no evaluationsLatest version Apr 28, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

On the state of protein function prediction: a report on the fourth CAFA challenge

DIOPT: the DRSC Integrative Ortholog Prediction Tool, 2026 update

Discriminative Site-Directed Protein Engineering via Lightweight CASPE Platform