Predicting Mutation-Induced Relative Protein-Ligand Binding Affinity Changes via Conformational Sampling and Diversity Integration with Subsampled Alphafold2 in Few-Shot Learning
Discuss this preprint
Start a discussion What are Sciety discussions?Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
Predicting the impact of mutations on protein-ligand binding affinity is crucial in drug discovery, particularly in addressing drug resistance and repurposing existing drugs. Current structure-based methods are constrained by their dependence on known protein-ligand complex structures. The absence of structural data for mutated proteins, along with the potential of mutations to modify protein conformational states, brings challenges for reliable predictions. The downstream application of conformational sampling of Alphafold, such as the prediction of binding free energy changes induced by mutation, has not been fully explored. To tackle the challenge, we propose a few-shot learning approach that integrates AlphaFold2 subsampling, ensemble docking, and Siamese learning to predict the relative binding affinity changes induced by mutations. We validate our approach for predicting binding affinity changes in 31 variants of the mutated Abelson tyrosine kinase (ABL). The conformation ensembles for each ABL variant are generated using AlphaFold2 subsampling. The most likely conformational states are selected by Gaussian fitting to construct the pairwise structural inputs and compute relative binding energies. Then the relative binding affinities are derived by averaging the values predicted by the Siamese learning model across the constructed conformational pairs. By benchmarking on the Tyrosine kinase inhibitors (TKI) dataset and the direct binding affinity using the refined set of PDBbind database, our proposed approach achieves an increased Spearman correlation coefficient for five out of six TKI molecules across 31 mutants when estimating relative binding affinities. Although the proposed method is validated on the ABL, it can be transferred and applied to other drug-target interaction predictions when the conformational flexibility of proteins needs to be considered. We anticipate the approach will be valuable for predicting binding energy changes induced by mutations in proteins or ligands, facilitating drug discovery efforts.