Under which circumstances do genomic neural networks learn motifs and their interactions?

Mike Thompson
Ben Lehner

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

The use of neural networks to model genomic data in sequence-to-function scenarios has soared over the last decade. There remains much debate about whether these models are interpretable, either inherently, or when using downstream interpretability or explainability techniques (xAI). Conclusions are further complicated by the steady publication of novel models, each with their own architectures, evaluations, and xAI experimental designs. Here, we posit that many of these complications arise due to a lack of explicit specification of a generative model, baseline comparators, and thorough evaluation. Consequently, we attempt to reconcile concerns of interpretability under a motif-based generative model by simulating at scale over 1000 motif-based genetic architectures and evaluating the ability of different model architectures to predict an outcome given a sequence as input. We first show that a single convolutional layer is sufficient to discover motifs in a sequence-to-function model due to the way in which it shares the gradient locally amongst nucleotides. We next build upon this by showing that across genetic and network architectures— including attention, LSTMs, and stacked convolutions—most models are capable of modeling motifs and their interactions, with certain models outperforming others across genetic contexts and sample sizes. Distinguishing between shallow-level interpretations of motifs and deeper, gradient-based interpretations of motifs, we show that these approaches discover separate but overlapping sets of motifs, depending on motif characteristics. Finally, we validate our findings on an experimental dataset, and conclude that while attention is accurate, there are genetic contexts in which other neural networks complement findings from attention-based models and produce higher correlations between predictive performance and interpretability. The work here suggests that when a generative model is correctly specified, most models are to an extent interpretable, whether their architectures are inherently so or not. Moreover, our work highlights opportunities for methods development in motif discovery and also implies that employing a mixture of model architectures may be best for biological discovery.

Version published to 10.1101/2025.07.25.666754 on bioRxiv
Jul 27, 2025

Genomic Touchstone: Benchmarking Genomic Language Models in the Context of the Central Dogma

This article has 24 authors:
1. Yihui Wang
2. Zhiyuan Cai
3. Qian Zeng
4. Yihang Gao
5. Jiarui Ouyang
6. Yingxue Xu
7. Shu Yang
8. Sunan He
9. Yuxiang Nie
10. Yu Cai
11. Fengtao Zhou
12. Cheng Jin
13. Xi Wang
14. Zhi Xie
15. Danqing Zhu
16. Ting Xie
17. Kwang-Ting Cheng
18. Can Yang
19. Xi Fu
20. Jiguang Wang
21. Kang Zhang
22. Jianhua Yao
23. Raul Rabadan
24. Hao Chen
This article has no evaluationsLatest version Jun 30, 2025
Interpreting Attention Mechanisms in Genomic Transformer Models: A Framework for Biological Insights

This article has 7 authors:
1. Micaela E. Consens
2. Ander Diaz-Navarro
3. Vivian Chu
4. Lincoln Stein
5. Housheng Hansen He
6. Alan Moses
7. Bo Wang
This article has no evaluationsLatest version Jun 27, 2025
Learning sequence to predict gain- or loss-of-function variants

This article has 5 authors:
1. Doyeon Ha
2. Sungnam Kim
3. Kisang Kwon
4. Wonseok Chung
5. Joohyun Han
This article has no evaluationsLatest version Jun 6, 2025

Listed in

Abstract

Article activity feed

Related articles

Genomic Touchstone: Benchmarking Genomic Language Models in the Context of the Central Dogma

Interpreting Attention Mechanisms in Genomic Transformer Models: A Framework for Biological Insights

Learning sequence to predict gain- or loss-of-function variants