Benchmarking DNA Foundation Models for zero-shot variant effect prediction: the role of context, training, and architecture

Ilaria Alfisi
Francesca Ciapi
Marta Baragli
Alberto Magi

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

In this study, we systematically evaluate the performance of several DNA foundation models (NT, DNABERT, and HyenaDNA) in predicting the functional impact of genetic variants using Zero-shot scoring, a method that does not require task-specific fine-tuning. We assess the models’ sensitivity to sequence alterations introduced by Single Nucleotide Variants (SNVs), comparing their ability to capture both local and extended contextual effects. Using pathogenic, benign, and uncertain SNVs from ClinVar, we show that large multi-species NT models outperform other architectures in detecting functional consequences, not only at the mutation site but also in adjacent regions. These models exhibit superior discriminative power across variant categories, especially when aggregating Zero-shot scores over multiple surrounding tokens. Conversely, models trained solely on human sequences, such as DNABERT and HyenaDNA, show limited contextual awareness and reduced ability to differentiate variant effects. Our findings highlight the critical importance of model size, training objective, and training data diversity in shaping model performance. Furthermore, we discuss current limitations in modeling long-range dependencies in genomic sequences and suggest that innovations in transformer architectures, such as sparse attention or memory-augmented models, may provide viable paths toward scalable, genome-wide variant effect prediction.

Version published to 10.1101/2025.06.15.659748v1 on bioRxiv
Jun 19, 2025

From Likelihood to Fitness: Improving Variant Effect Prediction in Protein and Genome Language Models

This article has 4 authors:
1. Charles W. J. Pugh
2. Paulina G. Nuñez-Valencia
3. Mafalda Dias
4. Jonathan Frazer
Reviewed by Arcadia Science

This article has 4 evaluationsAppears in 1 listLatest version May 24, 2025Latest activity Jun 6, 2025
VEFill: a model for accurate and generalizable deep mutational scanning score imputation across protein domains

This article has 3 authors:
1. Polina V Polunina
2. Wolfgang Maier
3. Alan F Rubin
This article has no evaluationsLatest version May 14, 2025
pLM-SAV: A Δ-Embedding Approach for Predicting Pathogenic Single Amino Acid Variants

This article has 5 authors:
1. Orsolya Gereben
2. Hedvig Tordai
3. Lana Khamisi
4. Alexandria Kouri
5. Tamás Hegedűs
This article has no evaluationsLatest version May 31, 2025

Listed in

Abstract

Article activity feed

Related articles

From Likelihood to Fitness: Improving Variant Effect Prediction in Protein and Genome Language Models

VEFill: a model for accurate and generalizable deep mutational scanning score imputation across protein domains

pLM-SAV: A Δ-Embedding Approach for Predicting Pathogenic Single Amino Acid Variants