ELEGANT: Combining Simultaneous Node and Edge Generation with Landmark Multi-Task Learning for Facial Action Unit Recognition

Andrew Sumsion
Dah-Jye Lee

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Facial Action Units (AUs) have recently been used in dementia detection, pain detection, talking head generation, and even facial reconstruction tasks. The success of each of these applications is at least partially due to the performance of the underlying AU recognition model. The commonly reported metric for comparing AU recognition is the average F1 score. Improving the average F1 score for AU recognition will directly improve the performance of each application that requires AU recognition. To improve the average F1 score, we propose simultaneously generating the nodes and edges for the graph neural network, strategically using landmark data, present additional AU recognition multi-task learning methods, and introduce ensemble learning to AU recognition. Although most current solutions for AU recognition generate the nodes and the edges separately, our proposed method demonstrates the improvement that comes from simultaneously generating the nodes and edges. In addition, our method proposes to use the available landmark data in a multi-task learning method. Our solution also applies ensemble learning to AU recognition. Through extensive experimentation, we demonstrate an improvement in the state-of-the-art average F1 score from 66.3 to 67.3 and from 66.9 to 67.8 on the BP4D and DISFA datasets, a considerable improvement in this field. These results underscore the substantial improvements our proposed method brings to the application of AU recognition.

Version published to 10.21203/rs.3.rs-7660029/v1 on Research Square
Feb 11, 2026

A Deep Siamese ResNet-50 Framework with Triplet loss for High-Precision Face Verification

This article has 5 authors:
1. Phan Thi Huong
2. Huynh Cao Tuan
3. Nguyen Minh Son
4. Tran Tay
5. Thanh Q. Nguyen
This article has no evaluationsLatest version Feb 24, 2026
Acoustic Feature Synergy and Self-Supervised Learning for Robust Tabla Stroke Classification

This article has 4 authors:
1. Jaipreet Kaur
2. Rajdeep Singh Sohal
3. Manbir Kaur
4. Satinder Kaur
This article has no evaluationsLatest version Mar 24, 2026
A Comprehensive Review in Unimodal and Multimodal Emotion Recognition

This article has 39 authors:
1. Jiachen Luo
2. Qu Yang
3. Jiajun He
4. Yining Hua
5. Zheng Lian
6. Yuanchao Li
7. Siyang Song
8. Wen Wu
9. Dingdong Wang
10. Shuai Shen
11. Jingyao Wu
12. Guimin Hu
13. He Hu
14. Yong Li
15. Zixing Zhang
16. Jiadong Wang
17. Sifan Zhou
18. Zuojin Tang
19. Canran Xiao
20. Sheng Xu
21. Zhenjun Zhao
22. Xiangyang Xue
23. Sicheng Zhao
24. Yong Dai
25. Tomoki Toda
26. Licai Sun
27. Kailai Yang
28. Liyun Zhang
29. Cong Cai
30. Jiamin Du
31. Ziyang Ma
32. Mingjie Chen
33. Chengxuan Qian
34. Zhenlong Yuan
35. Xie Chen
36. Huy Phan
37. Lin Wang
38. Björn Schuller
39. Joshua Reiss
This article has no evaluationsLatest version Mar 30, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

A Deep Siamese ResNet-50 Framework with Triplet loss for High-Precision Face Verification

Acoustic Feature Synergy and Self-Supervised Learning for Robust Tabla Stroke Classification

A Comprehensive Review in Unimodal and Multimodal Emotion Recognition