Deep Learning of High-throughput Transcription Factor–DNA Binding Affinity Data: Quantitative Comparison with Pairwise-Additive Models

Ke Shen
Zhi Wang
Xiaoliang Sunney Xie

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Transcription factors (TFs) regulate gene expression by binding to specific DNA sequences. Widely used models of TF–DNA binding, such as position weight matrices (PWMs) and position-specific affinity matrices (PSAMs), assume binding free energy is the sum of independent base contributions. However, there is ample evidence that non-additive effects significantly influence TF binding. Here, we utilize data from a high-throughput in vitro assay ( ivt FOODIE) to generate genome-scale TF–DNA dissociation constants ( K _d ) and systematically evaluate sequence-to-affinity models of increasing complexity. We demonstrate that pairwise additive models exhibit systematic deviations from the measured affinity landscapes. Models incorporating adjacent dinucleotide interactions and deep learning architectures achieve markedly improved agreement with experimental K _d values. The magnitude of this non-pairwise-additivity depends strongly on the TF family. In silico mutation screening reveals widespread, TF-specific long-range interposition dependencies, highlighting the role of energetic coupling across distant positions in target recognition. These results provide a quantitative framework for comparing non-pairwise-additive energetic effects across diverse TFs.

Version published to 10.64898/2026.05.18.725888 on bioRxiv
May 19, 2026

Prediction of Transcription Factor DNA Binding Affinity with High-Throughput K _d Measurements and Deep Learning

This article has 22 authors:
1. Zhi Wang
2. Di Wang
3. Ke Shen
4. Junchen Luo
5. Xinyao Wang
6. Nan Wu
7. Yunzhi Lang
8. Xiangyu Wang
9. Jun Ren
10. Wenyang Dong
11. Lu Pan
12. Yitong Lyu
13. Gang Li
14. Dubai Li
15. Chen Xie
16. Zhen Zhang
17. Shijun Yu
18. Liuying Shan
19. Nannan Zhang
20. Jian Yan
21. Mingchen Chen
22. Xiaoliang Sunney Xie
This article has no evaluationsLatest version May 20, 2026
TFBindFormer: A Cross-Attention Transformer for Transcription Factor–DNA Binding Prediction

This article has 4 authors:
1. Ping Liu
2. Lyuwei Wang
3. Shreya Basnet
4. Jianlin Cheng
This article has no evaluationsLatest version Apr 11, 2026
Decoding TF-Specific Predictability in Cross-Species Binding Site Inference

This article has 4 authors:
1. Yiman Wang
2. Guifen Liu
3. Yucheng Wang
4. Yong Zhang
This article has no evaluationsLatest version Apr 16, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Prediction of Transcription Factor DNA Binding Affinity with High-Throughput K d Measurements and Deep Learning

TFBindFormer: A Cross-Attention Transformer for Transcription Factor–DNA Binding Prediction

Decoding TF-Specific Predictability in Cross-Species Binding Site Inference

Prediction of Transcription Factor DNA Binding Affinity with High-Throughput K _d Measurements and Deep Learning