Multi-Domain Feature Enhancement and Fusion Transformer with Bilateral Facial Structure Awareness for Robust and Cross-Domain Facial Expression Recognition

Katherine Lin Shu
Mu-Jiang-Shan Wang

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Facial expression recognition (FER) is essential for affective computing and human–computer interaction, but robust performance under unconstrained conditions (illumination, pose, occlusion, cultural diversity) remains difficult to achieve. Traditional CNNs focus on local details and struggle with global dependencies, while Vision Transformers (ViT) model global context yet often overlook fine-grained texture and frequency cues that are crucial for subtle expression discrimination. To address these issues, we propose a unified Multi-Domain Feature Enhancement and Fusion (MDFEF) framework that combines a ViT-based global encoder with channel, spatial, and frequency branches for complementary feature learning. Taking into account the approximately bilateral symmetry of human faces and the asymmetric distortions introduced by pose, occlusion, and illumination, MDFEF is designed to learn symmetry-aware and asymmetry-robust representations for facial expression recognition across diverse domains. An adaptive Cross-Domain Feature Enhancement and Fusion (CDFEF) module further aligns and integrates heterogeneous features, enhancing domain-consistent and illumination-robust expression understanding. Experiments on KDEF, FER2013, and RAF-DB show that the proposed model outperforms representative CNN-, Transformer-, and ensemble-based baselines in both accuracy and F1-score, confirming its effectiveness and strong generalization ability for real-world FER.

Version published to 10.20944/preprints202511.1524.v1
Nov 20, 2025

Facial Expression Recognition via Variational Inference

This article has 2 authors:
1. Gang Lv
2. JunLing Zhang
This article has no evaluationsLatest version Oct 9, 2025
Transfer Learning Through Adaptive Fine-Tuning and Attention Mechanism Framework for Facial Expression Recognition in the Elderly

This article has 6 authors:
1. Nouhaila Labzour
2. Sanaa EL Fkihi
3. Mohamed Akil
4. Yahya Zennayi
5. Meftah Ghrissi
6. Omar Bourja
This article has no evaluationsLatest version Oct 14, 2025
MNI-GAIR: Multi-scale Normal Image and Grid Attention-based Image Recognition

This article has 11 authors:
1. Maoyang Xu
2. Zhuqing Zheng
3. Borun He
4. Yinfeng Chen
5. Jinye Wang
6. Chen Han
7. Gengyifan Shang
8. Lihe Chen
9. Wancheng Zhao
10. YuFei Zhou
11. Changjiang Zhang
This article has no evaluationsLatest version Nov 19, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Facial Expression Recognition via Variational Inference

Transfer Learning Through Adaptive Fine-Tuning and Attention Mechanism Framework for Facial Expression Recognition in the Elderly

MNI-GAIR: Multi-scale Normal Image and Grid Attention-based Image Recognition