MS-Adapter: Multi-scaled Adapter for Efficient DeepFake Detection

Ruofan Wang
Aimin Pan
Vladimir Y Mariano

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Existing deepfake detection methods overly rely on low-level forgery patterns, leading to poor performance when encounteringunseen forgery types or low-quality images. Recently, Vision Transformer (ViT) pretrained on large-scale datasets havedemonstrated strong generalization capabilities across various image downstream tasks. However, parameter-efficient fine tuning methods for ViTs have shown limited effectiveness in DeepFake detection, mainly because ViTs rely on high-levelsemantics while struggling to capture fine grained local details. To address this issue, this paper proposes MS-Adapter, amulti-scale adapter network for efficient deepfake detection. By embedding multi-scale adapter modules within the pretrainedViT, MS-Adapter progressively extracts and fuses features from low-level forgery artifacts to high-level semantic forgery patternsacross multiple scales. At the same time, the Temporal Aggregation Transformer receives the frame-level features extractedby the Multi-Scale Adapter and performs temporal modeling on these features to enhance forgery detection performance.Experimental results demonstrate that MS-Adapter achieves superior performance on multiple datasets, including FF++,Celeb-DFv2, and DFDC, while requiring only a small number of trainable parameters.

Version published to 10.21203/rs.3.rs-6354040/v1 on Research Square
Apr 30, 2025

MPDenoiseNet: Resource-Efficient Deep Learning Approach for Image Denoising

This article has 2 authors:
1. Mostafa Kamal
2. Walid Ibrahim Al-Atabany
This article has no evaluationsLatest version May 20, 2025
UASR: An Unified-Attention Mixer Network for Efficient Image Super-Resolution

This article has 5 authors:
1. Yuqiao Huang
2. Wei Zhou
3. Zhaowen Huang
4. Tao Su
5. Dihu Chen
This article has no evaluationsLatest version Jun 25, 2025
PUNet: A Lightweight Parallel U-Net Architecture Integrating Mamba-CNN for High-Precision Image Segmentation

This article has 5 authors:
1. Zhaoyan Xie
2. Xiaowei Li
3. Hongyao Ma
4. Sihao Wu
5. Dayou Cui
This article has no evaluationsLatest version May 29, 2025

Listed in

Abstract

Article activity feed

Related articles

MPDenoiseNet: Resource-Efficient Deep Learning Approach for Image Denoising

UASR: An Unified-Attention Mixer Network for Efficient Image Super-Resolution

PUNet: A Lightweight Parallel U-Net Architecture Integrating Mamba-CNN for High-Precision Image Segmentation