Self-Attention Factor-Tuning for Parameter Efficient Fine-Tuning

Jason Abohwo

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Transformers have revolutionized the fields of Natural Language Processing and Computer Vision - a result of their ability to capture long-range dependencies with their key innovation: the attention mechanism. Despite the success of these models, their growing complexity has led to an ever-increasing need for processing power, making their practical applications less feasible. In recent years, tensor decomposition-based parameter-efficient fine-tuning techniques have emerged as a promising solution to the computational bottleneck. In this research, we investigate the use of a modified version of Factor Tuning that lessens inter-layer associations that the original Factor Tuning creates and focuses exclusively on attention mechanisms. We refer to this method as Self-Attention Factor-Tuning. To evaluate the effectiveness of our approach, we conduct experiments with Vision Transformers using all 19 datasets from the VTAB-1k benchmark for image classification. The results demonstrate that the proposed framework effectively reduces the number of parameters required to fine-tune a transformer, achieving new state-of-the-art performance on three of the 19 datasets in the benchmark and outperforming the original Factor-Tuning paradigm as well as various other competitive techniques, whilst using significantly fewer parameters.

Version published to 10.21203/rs.3.rs-3487308/v2 on Research Square
Feb 13, 2024
Version published to 10.21203/rs.3.rs-3487308/v1 on Research Square
Oct 26, 2023

Advances in Parameter-Efficient Fine-Tuning: Optimizing Foundation Models for Scalable AI

This article has 1 author:
1. Shufen Zhihao
This article has no evaluationsLatest version Mar 27, 2025
Efficient Adaptation of Pre-trained Models: A Survey of PEFT for Language, Vision, and Multimodal Learning

This article has 2 authors:
1. Cheng Zhihao
2. Shufen Zhihao
This article has no evaluationsLatest version Apr 28, 2025
Revisiting Fine-Tuning: A Survey of Parameter-Efficient Techniques for Large AI Models

This article has 3 authors:
1. Shufen Lei
2. Yin Hua
3. Shufen Zhihao
This article has no evaluationsLatest version Apr 9, 2025

Listed in

Abstract

Article activity feed

Related articles

Advances in Parameter-Efficient Fine-Tuning: Optimizing Foundation Models for Scalable AI

Efficient Adaptation of Pre-trained Models: A Survey of PEFT for Language, Vision, and Multimodal Learning

Revisiting Fine-Tuning: A Survey of Parameter-Efficient Techniques for Large AI Models