NAStructuralDB : Structural database to facilitate computational studies of molecular modeling and recognition of proteins with special focus on antibody-antigen interactions

Read the full article See related articles

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.
Log in to save this article

Abstract

Studying the interactions between antibodies and antigens is fundamental to the development of novel therapeutic biologics. Predictions of such interactions start with data collection. Though there exist reliable resources to identify antibody structures in the Protein Data Bank (PDB), such data still requires substantial processing to be usable in predictive tasks. Redundancy in sequences needs to be removed to avoid data leakages between train, test and validation sets. Descriptors such as surface accessibility, secondary structure and antibody region information need to be additionally annotated. Information on inter- and intra-molecular contacts, which is crucial to studying paratope/epitope information, needs to be collected. The specialized immunoglobulin format of Nanobodies® requires a separate dataset mirroring that of antibodies, given that their structure contains only a single VHH chain. Because antibody-antigen structures account for a small amount of all protein-protein contacts, having a molecular contact reference from other proteins is also desired. To address these issues, we introduce NAStructuralDB ( https://naturalantibody.com/na-structural/ ), a dataset of processed structures of antibodies, Nanobodies®, proteins and their complexes with molecular contact information and associated annotations. We use the opportunity of having collected the contact data to provide a reference of binding propensities of different residues across distinct contact types. We anticipate that this dataset will accelerate a broad range of predictive tasks by standardizing common, time-consuming data preparation steps in antibody and protein design.

Article activity feed