AlphaFold 3 accurately models natural variants of Helicobacter pylori catalase KatA
Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
Subtle changes in protein sequence can equate to large changes in function, such as enabling pathogens to evade the immune system, hindering antibody recognition of antigens, or conferring antibiotic resistance. Even single amino acid substitutions may alter ligand binding affinity, enzymatic activity, and protein stability. Yet, due to limitations in time and resources, proteins closely related in sequence to those already characterized often remain unexamined. AlphaFold has emerged as a promising tool for protein structure prediction, though its utility in modeling single amino acid substitutions remains uncertain. In this study, we assessed AlphaFold 3’s accuracy in modeling natural variants of the Helicobacter pylori catalase KatA by comparing its predictions to a novel high-resolution crystal structure of KatA from strain SS1. This variant contains key substitutions at residues 234, 237, 255, and 421 relative to the well-characterized strain 26695. AlphaFold 3 models accurately reproduced the global structure and local conformations of most variant residues, with high fidelity in conservative substitutions but variable accuracy in more flexible or interface-exposed sites. We further explored how user inputs, such as incorrect oligomeric states or sequence modifications, influence prediction quality. While AlphaFold 3 consistently produced high-quality models, deviations at variant sites occurred when incorrect oligomeric states were specified. Our findings highlight both the strengths and limitations of AlphaFold 3 in modeling natural protein variants and underscore the importance of accurate user input for reliable structural predictions.
Importance
Experimental structure determination is rarely performed for natural protein variants possessing only minor amino acid differences from published structures, even though small substitutions can significantly impact structure and function. Here, we present a case study showing that AlphaFold 3 can accurately model the structures of natural protein variants. However, providing an incorrect oligomeric state can reduce model accuracy—an error that non-expert users may easily make.