AstraBIND: Graph Attention Network for Predicting Ligand Binding Sites

Aniruddh Goteti
Alexandra Vasilyeva
Çağlar Bozkurt

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Predicting ligand binding sites is central to computational biology and drug discovery. Existing machine learning approaches either use protein sequence, structure, or both. While structure-based deep learning models typically outperform sequence-based methods, they often require high computational cost or ligand-specific data, forcing a trade-off between accuracy and scalability.

We present AstraBIND, a lightweight graph neural network that bridges this gap by integrating protein sequence, structure (experimental or predicted), and homology information to predict ligand classes and binding residues within minutes. The model employs a GATv2 architecture with 0.9 M parameters, trained on ¿250 000 curated protein–ligand complexes across 16 ligand categories. By encoding residue-level features and spatial geometry through graph attention, AstraBIND identifies binding residues and ligand types while maintaining structural consistency.

In benchmarking, AstraBIND achieved a weighted macro-F1 of 0.47 across all ligand classes, with top performance for nucleotides (F1 = 0.79), porphyrins (0.74), and cofactors (0.73). Case studies, including p53 and CRFR1, demonstrate robust pocket localization for diverse proteins. Combined with its minimal inference time and broad ligand coverage, AstraBIND enables rapid in-silico screening and integration into laboratory workflows. Together with other Astra ML models (1; 2), it represents a step toward real-time protein design and validation pipelines.

Astra models are available at https://www.orbion.life .

Version published to 10.1101/2025.11.10.687555 on bioRxiv
Nov 11, 2025

Enhancing molecular property prediction via transformer with dual graph representation

This article has 2 authors:
1. Shuyuan Zhang
2. Alexei Lapkin
This article has no evaluationsLatest version Dec 9, 2025
Feature-Optimized Machine Learning Benchmarking for Protein Interface Prediction in Permanent Homodimer Complexes with Distinct Structural Features

This article has 4 authors:
1. Tayyip Topuz
2. Zeki Erdem
3. Halil Bisgin
4. E. Demet Akten
This article has no evaluationsLatest version Feb 2, 2026
Nuclear-Charge-Guided Mamba with KAN Dynamic Mixture for Molecular Property Prediction

This article has 1 author:
1. Hong Wang
This article has no evaluationsLatest version Dec 30, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Enhancing molecular property prediction via transformer with dual graph representation

Feature-Optimized Machine Learning Benchmarking for Protein Interface Prediction in Permanent Homodimer Complexes with Distinct Structural Features

Nuclear-Charge-Guided Mamba with KAN Dynamic Mixture for Molecular Property Prediction