DeorphaNN: Virtual screening of GPCR peptide agonists using AlphaFold-predicted active state complexes and deep learning embeddings
Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
G protein-coupled receptors (GPCRs) are important cell surface receptors involved in numerous physiological processes. Although peptides are the cognate ligands for many of these receptors, identifying endogenous peptide agonists for GPCRs remains a significant challenge. Deep learning-based protein structure prediction algorithms, such as AlphaFold (AF) have utility in non-structural tasks including protein-protein interaction prediction, suggesting they may be useful for predicting GPCR-peptide agonist interactions. Leveraging a dataset of experimentally validated agonist and non-agonist GPCR-peptide interactions from Caenorhabditis elegans , we show that AF-Multimer confidence metrics enable partial discrimination between GPCR-agonist and non-agonist complexes. To better reflect agonist-bound conformations, AF-Multistate templates are used to produce active-state GPCR-peptide complexes, improving discriminatory power. Embeddings from the final hidden layer of AF-Multimer’s neural network, which capture structural and interaction patterns, were used to train random forest classifiers to assess whether AF-Multimer protein representations can distinguish agonist from non-agonist complexes. Feature performance analysis reveals that AF-Multimer’s pair representations outperform single representations, with distinct subregions of the pair representation providing complementary predictive signals. Building on these findings, we developed DeorphaNN—a graph neural network that integrates active-state GPCR-peptide structural predictions, interatomic interactions, and pair representations to predict agonist identity. DeorphaNN’s predictive utility generalizes to datasets outside of C. elegans , including annelids and humans, and experimental validation of predicted agonists for two orphan GPCRs uncovers their cognate agonists. Our approach offers a resource to accelerate GPCR deorphanization through the in silico identification of receptor-agonist candidates for AI-guided experimental validation.
SIGNIFICANCE STATEMENT
This study addresses the challenge of identifying peptide agonists for G protein-coupled receptors (GPCRs), which play crucial roles in numerous physiological processes. We have developed a machine learning model that combines predicted active state complexes, interatomic interactions, and deep learning protein representations obtained from AlphaFold-Multimer to identify potential peptide agonists for GPCRs. We then validated this approach by identifying and experimentally confirming novel ligands for two GPCRs. This approach can accelerate the identification of promising peptide ligands for GPCRs, offering a valuable tool to guide GPCR deorphanization efforts and improve our understanding of critical signalling pathways involved in health and disease.