Tempo-Spatial-Fusion Network: A Novel Framework for Deepfake Detection through Dynamic Integration of Spatial and Temporal Features

Asim Manzoor

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

The rapid development in deepfake technology has led to increasingly sophisticated AI-generated content that threatens the integrity of digital media. To address this challenge, we present Tempo-Spatial-Fusion Network (TSF-3 Net) that systematically integrates spatial and temporal analysis for robust deepfake detection. Unlike previous approaches that focus exclusively on either spatial artifacts or temporal inconsistencies, TSF-Net introduces three key innovations. First, it employs Cross-Modal Attention Fusion mechanism that dynamically integrates complementary features from EfficientNetV2L and XceptionNet. Second, it incorporates Temporal Inconsistency Attention Module that explicitly targets frame-to-frame discontinuities. Third, it uses Artifact-Aware Loss Function that directly penalizes predictions inconsistent with detected manipulation cues. Extensive experiments on DFDC and FaceForensics++ datasets demonstrate the TSF-Net's superior performance (95.36% accuracy, 0.92 F1-score). It significantly outperforms single-model approaches and existing hybrid frameworks. Our theoretical analysis provides new insights into artifact persistence across deepfake generation techniques. Our adaptive computational scaling enables efficient deployment across diverse computational environments. The proposed framework advances the state-of-the-art deepfake detection by bridging the gap between spatial and temporal analysis. It maintains interpretability through novel visualization techniques.

Version published to 10.31219/osf.io/yzk93_v1 on OSF Preprints
Apr 21, 2025

YOLOv10n-SD:a novel real-time object detectionmodel for Driver distracted driving detection

This article has 4 authors:
1. Yi Liu
2. QiaoXing Li
3. Lu Xiao
4. Sen Zhang
This article has no evaluationsLatest version May 12, 2025
MA-YOLO: Multi-Scale Attention-Enhanced YOLO for Object Detection in Remote Sensing Images

This article has 3 authors:
1. TingSong Sun
2. JianMin Wang
3. Jianyu Sun
This article has no evaluationsLatest version May 2, 2025
Weakly Supervised Temporal Action Localization Based on Feature Enhancement

This article has 2 authors:
1. Hongying Zhang
2. Yi Yao
This article has no evaluationsLatest version Jun 9, 2025

Listed in

Abstract

Article activity feed

Related articles

YOLOv10n-SD:a novel real-time object detectionmodel for Driver distracted driving detection

MA-YOLO: Multi-Scale Attention-Enhanced YOLO for Object Detection in Remote Sensing Images

Weakly Supervised Temporal Action Localization Based on Feature Enhancement