TFBindFormer: A Cross-Attention Transformer for Transcription Factor–DNA Binding Prediction

Ping Liu
Lyuwei Wang
Shreya Basnet
Jianlin Cheng

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Transcription factors (TFs) are central regulators of gene expression, and their selective recognition of genomic DNA underlies various biological processes. Experimental profiling of TF–DNA interactions using chromatin immunoprecipitation followed by sequencing (ChIP-seq) provides high-resolution maps of in vivo TF-DNA binding but remains costly, labor-intensive, and inherently low-throughput, limiting their scalability across different transcription factors, cell types, and regulatory conditions. Computational modeling therefore plays an essential role in inferring TF–DNA interactions at genome scale. However, most existing computational models rely solely on DNA sequence and chromatin features to predict TF–DNA binding, neglecting TF-specific protein information. This omission limits their ability to capture protein-dependent binding specificity. Here, we present TFBindFormer, a hybrid cross-attention transformer that explicitly integrates genomic DNA features with TF-specific representations derived from protein sequences and structures. By modeling protein-conditioned, position-specific TF–DNA interactions, TFBindFormer enables direct learning of molecular determinants underlying DNA recognition. Evaluated across hundreds of cell-type–specific TFs and hundreds of millions of genome-wide DNA bins, TFBindFormer consistently outperforms DNA-only baselines, achieving substantial gains in both area under precision-recall curve (AUPRC) and area under receiver operating characteristic curve (AUROC). Together, these results demonstrate that integrating TF and DNA features via cross-attention enables TFBindFormer to serve as an effective and scalable framework for large-scale TF–DNA binding prediction.

Version published to 10.64898/2026.04.09.717563 on bioRxiv
Apr 11, 2026

Decoding TF-Specific Predictability in Cross-Species Binding Site Inference

This article has 4 authors:
1. Yiman Wang
2. Guifen Liu
3. Yucheng Wang
4. Yong Zhang
This article has no evaluationsLatest version Apr 16, 2026
Beyond motif recognition: Specificity of human transcription factors in yeast

This article has 7 authors:
1. Joshua Bugis
2. Dan Reuben Zlotnik-Weinberg
3. Wajd Manadre
4. Vladimir Mindel
5. Yunwei Lu
6. Juan Ignacio Fuxman Bass
7. Naama Barkai
This article has no evaluationsLatest version Apr 29, 2026
Proteome-wide identification and modeling of interactions between transactivation domains and arginine-glycine-rich regions

This article has 9 authors:
1. Yukti Khanna
2. Arijana Hajdarević
3. Johanna Pirchner
4. Sinem Usluer
5. Anastasia Rakhimbekova
6. Iva Pritišanac
7. Sören von Bülow
8. Kresten Lindorff-Larsen
9. Tobias Madl
This article has no evaluationsLatest version Apr 16, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Decoding TF-Specific Predictability in Cross-Species Binding Site Inference

Beyond motif recognition: Specificity of human transcription factors in yeast

Proteome-wide identification and modeling of interactions between transactivation domains and arginine-glycine-rich regions