Genomic Foundationless Models: Pretraining Does Not Promise Performance

Kirill Vishniakov
Karthik Viswanathan
Aleksandr Medvedev
Praveenkumar Kanithi
Marco AF Pimentel
Ronnie Rajan
Shadab Khan

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

The success of Large Language Models has inspired the development of Genomic Foundation Models (GFMs) through similar pretraining techniques. However, the relationship between pretraining performance and effectiveness in down-stream genomic tasks remains unclear. Additionally, the high computational cost of pretraining raises questions about its cost-efficiency. To assess the usefulness of pretraining in genomics, we evaluated seven different GFMs across 52 diverse genomic tasks, comparing them to their counter-parts with randomly initialized weights. Surprisingly, we found that randomly initialized models can match or even surpass the performance of pretrained GFMs in finetuning and feature extraction tasks. We also discovered that pretrained GFMs fail to capture clinically relevant genetic mutations, which are crucial for understanding genetic disorders and phenotypic traits. Our results indicate that most of the current pretrained GFMs lack a “foundational” understanding of genomics and provide minimal utility, even for basic tasks such as sequence classification. These findings collectively highlight the need for critically rethinking the pretraining approaches for genomics. Our code is available at github.com/m42-health/gfm-random-eval.

Version published to 10.1101/2024.12.18.628606v2 on bioRxiv
Jun 25, 2025
Version published to 10.1101/2024.12.18.628606v1 on bioRxiv
Dec 20, 2024

Genomic Touchstone: Benchmarking Genomic Language Models in the Context of the Central Dogma

This article has 24 authors:
1. Yihui Wang
2. Zhiyuan Cai
3. Qian Zeng
4. Yihang Gao
5. Jiarui Ouyang
6. Yingxue Xu
7. Shu Yang
8. Sunan He
9. Yuxiang Nie
10. Yu Cai
11. Fengtao Zhou
12. Cheng Jin
13. Xi Wang
14. Zhi Xie
15. Danqing Zhu
16. Ting Xie
17. Kwang-Ting Cheng
18. Can Yang
19. Xi Fu
20. Jiguang Wang
21. Kang Zhang
22. Jianhua Yao
23. Raul Rabadan
24. Hao Chen
This article has no evaluationsLatest version Jun 30, 2025
Benchmarking DNA Foundation Models for zero-shot variant effect prediction: the role of context, training, and architecture

This article has 4 authors:
1. Ilaria Alfisi
2. Francesca Ciapi
3. Marta Baragli
4. Alberto Magi
This article has no evaluationsLatest version Jun 19, 2025
Interpreting Attention Mechanisms in Genomic Transformer Models: A Framework for Biological Insights

This article has 7 authors:
1. Micaela E. Consens
2. Ander Diaz-Navarro
3. Vivian Chu
4. Lincoln Stein
5. Housheng Hansen He
6. Alan Moses
7. Bo Wang
This article has no evaluationsLatest version Jun 27, 2025

Listed in

Abstract

Article activity feed

Related articles

Genomic Touchstone: Benchmarking Genomic Language Models in the Context of the Central Dogma

Benchmarking DNA Foundation Models for zero-shot variant effect prediction: the role of context, training, and architecture

Interpreting Attention Mechanisms in Genomic Transformer Models: A Framework for Biological Insights