Interpretable biophysical neural networks of transcriptional activation domains separate roles of protein abundance and coactivator binding

Claire LeBlanc
Pooja Agarwal
Jack Demaray
Gean Hu
Marissa Zintel
Angelica Lam
Joel Enrique Castro Hernandez
Max Staller

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Deep neural networks have improved the accuracy of many difficult prediction tasks in biology, but it remains challenging to interpret these networks and learn molecular mechanisms. Here, we address the interpretability challenges associated with predicting transcriptional activation domains from protein sequence. Activation domains, regions within transcription factors that drive gene expression, were traditionally difficult to predict due to their sequence diversity and poor conservation. Multiple deep neural networks can now accurately predict activation domains, but these predictors are difficult to interpret. With the goal of interpretability, we designed simple neural networks that incorporated biophysical models of activation domains. The simplicity of these neural networks allowed us to visualize their parameters and directly interpret what the networks learned. The biophysical neural networks revealed two new ways that arrangement (i.e. the sequence grammar) of activation domain controlled function: 1) hydrophobic residues both increase activation domain strength and decrease protein abundance, and 2) acidic residues control both activation domain strength and protein abundance. Notably, the biophysical neural networks helped us to recognize the same signatures in complex interpreters of the deeper neural networks. We demonstrate how combining biophysical and deep neural networks maximizes both prediction accuracy and interpretability to yield insights into biological mechanisms.

Version published to 10.1101/2025.09.19.677413 on bioRxiv
Sep 21, 2025

Artificial Intelligence–Driven Structural Mining Enables Functional Inference in the Human Dark Proteome

This article has 7 authors:
1. Valentina Carbonari
2. Annamaria Defilippo
3. Ugo Lomoio
4. Caterina Francesca Perri
5. Barbara Puccio
6. Pierangelo Veltri
7. Pietro Hiram Guzzi
This article has no evaluationsLatest version Dec 23, 2025
Minimal Perturbation of Activation Loop Dynamics Rewires Kinase Signaling

This article has 11 authors:
1. Paola Laurino
2. Prashant Jain
3. Dariia Yehorova
4. Ririn Febri
5. Ben Clifton
6. Andrei Demkiv
7. Gen-Ichiro Uechi
8. Michael Robinson
9. Shina Kamerlin
10. Mariko Okada
11. Akira Imamoto
This article has no evaluationsLatest version Jan 27, 2026
The Evolution of the AlphaFold Architecture

This article has 1 author:
1. Y.C.B.J. Dissanayaka
This article has no evaluationsLatest version Jan 9, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Artificial Intelligence–Driven Structural Mining Enables Functional Inference in the Human Dark Proteome

Minimal Perturbation of Activation Loop Dynamics Rewires Kinase Signaling

The Evolution of the AlphaFold Architecture