Personality Recognition Models Based on Visual Cues: Exploring the Intrinsic Links between PAD Dynamics and Personality Traits

Wenxi Chen
Liming Wang
Jing Yang
Rui Su
Ruotong Fang
Xing Huang

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Understanding human personality from nonverbal behavior is a longstanding challenge in psychology and artificial intelligence. This study presents a video-based, automated framework for personality recognition that integrates dynamic visual cues with the PAD (Pleasure-Arousal-Dominance) emotional state model. We introduce the Cross-Modal Attention Vision Transformer (CMA-ViT), a dual-stream, multi-task learning model that fuses raw video frames with pre-extracted features, including facial action units, head motion, gaze, and frame-by-frame PAD values. The model captures temporal dynamics in emotional expression, head motion, and gaze patterns to infer the Big Five personality traits.Experiments on the MDPE dataset demonstrate robust performance, with an average classification accuracy of 71.7%, highest for Neuroticism (90.1%) and lowest for Openness (57.4%), suggesting that not all of the personality traits are explicitly expressed in observable behaviors. Gradient-weighted feature importance analysis revealed that PAD emotional features, gaze patterns, and head-related cues are the primary contributors, while facial action units introduced noise in this dataset. Temporal analysis of PAD fluctuations further indicated that indices such as variability, frequency, intensity, and transition rate provide trait-relevant signals, supporting the notion that personality is reflected not only in average states but also in dynamic patterns of emotional change.These findings have methodological and theoretical implications: they highlight the value of integrating multi-dimensional information---such as temporal emotional dynamics, head motion, and gaze---for accurate personality recognition, challenge assumptions about the predictive role of facial actions, and empirically support dynamic models of personality such as Fleeson's Density Distribution Theory. This work provides a novel, interpretable framework for video-based personality computing, advancing both the accuracy and theoretical grounding of automated trait inference.

Version published to 10.31234/osf.io/26jum_v1 on OSF Preprints
Feb 25, 2026

Facial Emotion Recognition, Lower-Order Face Processing and Crystallized Intelligence in Personality Pathology: A Multimethod Study

This article has 11 authors:
1. Ludwig Ohse
2. Emanuel Jauk
3. Johannes Zimmermann
4. André Kerber
5. Leonie Kampe
6. Jil Mohr
7. Robert Schierz
8. Michael Rentrop
9. Roeland J. Verhallen
10. Isabel Dziobek
11. Susanne Hörz-Sagstetter
This article has no evaluationsLatest version Feb 18, 2026
Single-step initiation supports the interplay between fear expression perception and approach motivation

This article has 4 authors:
1. Laure Coudrat
2. Stéphanie Caharel
3. Jean-Luc Kop
4. Loïc P. Heurley
This article has no evaluationsLatest version Feb 13, 2026
Symmetry-Aware Structured Representation Learning for Unified Multi-Modal Physiological Modeling in Affective State and Preference Inference

This article has 2 authors:
1. Wenli Qu
2. Mu-Jiang-Shan Wang
This article has no evaluationsLatest version Feb 27, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Facial Emotion Recognition, Lower-Order Face Processing and Crystallized Intelligence in Personality Pathology: A Multimethod Study

Single-step initiation supports the interplay between fear expression perception and approach motivation

Symmetry-Aware Structured Representation Learning for Unified Multi-Modal Physiological Modeling in Affective State and Preference Inference