Evaluating the Effectiveness of Parameter-Efficient Fine-Tuning in Genomic Classification Tasks

Daniel Berman
Daniel Jimenez
Stanley Ta
Brian Merritt
Jeremy Ratcliff
Vijay Narayan

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Foundation models are increasingly being leveraged for biological tasks. To address the high memory requirements of fine-tuning large pre-trained language models, parameter efficient fine-tuning (PEFT) methods are also being increasingly utilized. Previous studies have shown minimal, if any, loss in performance when using PEFT on binary classification tasks. However, the impact of using PEFT on tasks with large classification spaces has not been systemically evaluated. In this work, we apply PEFT to the problem of taxonomic classification using pre-trained genomic language models as the classification backbone. We explore various training strategies—including PEFT, full fine-tuning, and partial fine-tuning—for classifying sequences at the superkingdom, phylum, and genus levels. We find that PEFT-trained models significantly underperform compared to those trained via full fine-tuning or partial fine-tuning. Additionally, we demonstrate increased performance of pretrained models over those randomly initialized.

Version published to 10.1101/2025.08.21.671544 on bioRxiv
Aug 26, 2025

Benchmarking DNA Foundation Models for zero-shot variant effect prediction: the role of context, training, and architecture

This article has 4 authors:
1. Ilaria Alfisi
2. Francesca Ciapi
3. Marta Baragli
4. Alberto Magi
This article has no evaluationsLatest version Aug 5, 2025
Pretraining Improves Prediction of Genomic Datasets Across Species

This article has 4 authors:
1. Fangrui Huang
2. Yitong Wang
3. Janet Song
4. Ashok Cutkosky
This article has no evaluationsLatest version Aug 24, 2025
On the (In)Significance of Feature Selection in High-Dimensional Datasets

This article has 3 authors:
1. Bhavesh Neekhra
2. Debayan Gupta
3. Partha Chakrabarti
This article has no evaluationsLatest version Aug 5, 2025

Listed in

Abstract

Article activity feed

Related articles

Benchmarking DNA Foundation Models for zero-shot variant effect prediction: the role of context, training, and architecture

Pretraining Improves Prediction of Genomic Datasets Across Species

On the (In)Significance of Feature Selection in High-Dimensional Datasets