Facial Action Units Detection Using Temporal Context and Feature Reassignment

Sipeng Yang
Hongyu Huang
Ying Sophie Huang
Xiaogang Jin

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Facial action units (AUs) encode the activations of facial muscle groups, playing a crucial role in expression analysis and facial animation. However, current deep learning AU detection methods primarily focus on single-image analysis, which limits the exploitation of rich temporal context for robust outcomes. Moreover, the scale of available datasets remains limited, leading models trained on these datasets to tend to suffer from overfitting issues. This paper proposes a novel AU detection method integrating spatial and temporal data with inter-subject feature reassignment for accurate and robust AU predictions. Our method first extracts regional features from facial images. Then, to effectively capture both the temporal context and identity-independent features, we introduce a Temporal feature Combination and Feature Reassignment (TC&FR) module, which transforms single-image features into a cohesive temporal sequence and fuses features across multiple subjects. This transformation encourages the model to utilize identity-independent features and temporal context, thus ensuring robust prediction outcomes. Experimental results demonstrate the enhancements brought by the proposed modules and the state-of-the-art (SOTA) results achieved by our method.

Version published to 10.22541/au.171843047.76223460/v1
Jun 15, 2024

ELEGANT: Combining Simultaneous Node and Edge Generation with Landmark Multi-Task Learning for Facial Action Unit Recognition

This article has 2 authors:
1. Andrew Sumsion
2. Dah-Jye Lee
This article has no evaluationsLatest version Feb 11, 2026
Efficient Unsupervised Clustering of Facial Geometry and Head Orientation Using 2D Landmarks

This article has 5 authors:
1. Vineet Kumar Rakesh
2. Amitabha Das
3. Tapas Samanta
4. Sarbajit Pal
5. Hemendra Kumar Pandey
This article has no evaluationsLatest version Feb 11, 2026
A Dual-Modality Spatio-Temporal and Frequency Framework for Robust Deepfake Detection

This article has 7 authors:
1. Arman Sajjadi
2. Sayna Sarvar
3. Mobin Nekou
4. Mahdi Fallah
5. Delaram Mehralizadeh
6. Mohammad Hossein Jabbarzadeh
7. Pedram Salehpour
This article has no evaluationsLatest version Feb 27, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

ELEGANT: Combining Simultaneous Node and Edge Generation with Landmark Multi-Task Learning for Facial Action Unit Recognition

Efficient Unsupervised Clustering of Facial Geometry and Head Orientation Using 2D Landmarks

A Dual-Modality Spatio-Temporal and Frequency Framework for Robust Deepfake Detection