Fine-Tuning Transformers Efficiently: A Survey on LoRA and Its Impact

Muchen Huan
Jianhong Shun

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

The rapid growth of Large Language Models (LLMs) has revolutionized natural language processing (NLP), enabling remarkable advancements in text generation, machine translation, and various downstream applications. However, fine-tuning these models remains computationally expensive due to their vast number of parameters. Low-Rank Adaptation (LoRA) has emerged as a highly efficient parameter-efficient fine-tuning (PEFT) technique that significantly reduces memory and computational costs while maintaining competitive performance. LoRA achieves this by freezing the pre-trained model weights and introducing trainable low-rank matrices into transformer layers, enabling efficient adaptation to new tasks. This survey provides a comprehensive review of LoRA, covering its theoretical foundations, practical implementation, recent advancements, and real-world applications. We explore various hybrid approaches that combine LoRA with other fine-tuning techniques, such as prompt tuning and adapter layers, as well as extensions like dynamic rank selection and quantized LoRA for enhanced efficiency. Additionally, we discuss the application of LoRA beyond traditional NLP tasks, including vision-language models, speech processing, and reinforcement learning. Despite its advantages, LoRA presents challenges such as inference overhead and optimal rank selection, which remain active areas of research. We highlight ongoing efforts to address these limitations and discuss future directions, including automated LoRA optimization, continual learning, and deployment in ultra-large foundation models. As AI models continue to grow in complexity, LoRA stands out as a scalable and cost-effective solution for fine-tuning, making it an essential tool for researchers and practitioners seeking to adapt LLMs efficiently.

Version published to 10.20944/preprints202502.1637.v1
Feb 20, 2025

Low-Rank Adaptation for Scalable Fine-Tuning of Pre-Trained Language Models

This article has 2 authors:
1. Haoyu Dong
2. Jianhong Shun
This article has no evaluationsLatest version Feb 11, 2025
Exploring the Landscape of Large and Small Language Models: Advancements, Trade-offs, and Future Directions

This article has 3 authors:
1. Duha Shams
2. Ikraam Salama
3. Idowu Callixtus
This article has no evaluationsLatest version Jan 7, 2025
Synergized Data Efficiency and Compression (SEC) Optimization for Large Language Models

This article has 6 authors:
1. Xinjin Li
2. Yu Ma
3. Yangchen Huang
4. Xingqi Wang
5. Yuzhen Lin
6. Chenxi Zhang
This article has no evaluationsLatest version Jan 27, 2025

Listed in

Abstract

Article activity feed

Related articles

Low-Rank Adaptation for Scalable Fine-Tuning of Pre-Trained Language Models

Exploring the Landscape of Large and Small Language Models: Advancements, Trade-offs, and Future Directions

Synergized Data Efficiency and Compression (SEC) Optimization for Large Language Models