Human protein interactome structure prediction at scale with Boltz-2

Read the full article See related articles

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.
Log in to save this article

Abstract

In humans, protein-protein interactions mediate numerous biological processes and are central to both normal physiology and disease. Extensive research efforts have aimed to elucidate the human protein interactome, and comprehensive databases now catalog these interactions at scale. However, structural coverage of the human protein interactome is limited and remains challenging to resolve through experimental methodology alone. Recent advances in artificial intelligence/machine learning (AI/ML)-based approaches for protein interaction structure prediction present opportunities for large-scale structural characterization of the human interactome. One such model, Boltz-2, which is capable of predicting the structures of protein complexes, may serve this objective. Here, we present de novo computed models of 1,394 binary human protein interaction structures predicted using Boltz-2 based on biochemically determined interaction data sourced from the IntAct database. We assessed the predicted interaction structures through different confidence metrics, which consider both overall structure and the interaction interface. These analyses indicated that prediction confidence tended to be greater for smaller complexes, while increased multiple sequence alignment (MSA) depth tended to improve prediction confidence. Additionally, we examined annotated protein domains and found that 679 of the predicted structural complexes contained a variety of domains with putative interaction involvement on the basis of interaction interface proximity. Furthermore, our analyses revealed intricate interaction networks within the context of biological function and cancer. This work demonstrates the utility of Boltz-2 for in silico structural modeling of the human protein interactome, highlighting both strengths and limitations, while also providing a novel view of broad functional contextualization. Ultimately, such modeling is expected to yield broad structural insights with relevance across multiple domains of biomedical research.

Article activity feed