Functional Group Composition: The Blueprint for Protein Interactions

Read the full article See related articles

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.
Log in to save this article

Abstract

Understanding the complex landscape of protein interactions, especially those involving intrinsically disordered proteins (IDPs), is fundamental yet challenging due to their structural heterogeneity and flexibility. Traditional sequence-based homology methods frequently fall short in characterizing IDP functions and interactions. Here, we present a novel approach leveraging supervised and unsupervised machine learning techniques, focusing exclusively on the compositional features of proteins. An Edmond-Ogston-inspired mixing model can reliably predict the degree of survivin (BIRC5) binding as a function of peptide composition alone, revealing a first interesting connection with the composition diagrams of chemical thermodynamics. By representing protein sequences through their functional group compositions, we demonstrate that specific compositions robustly predict binding interactions with survivin, an important human protein in cellular regulation pathways. Experimental validation via peptide microarray confirms the predictive power of our simplified compositional model, independent of exact amino acid sequences. Extending this method across the human proteome, we identified distinct compositional signatures correlating with survivin interactions and revealed fine grained biologically meaningful functional clusters based on compositional similarity. Our findings suggest a compositional blueprint underpinning protein interactions, offering a powerful, simplified framework to decode complex biological networks.

Article activity feed