A Position-Aware Multi-Head Self-Attention Model for Student Performance Prediction

Zhen Xu

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Student performance prediction is a central problem in educational data mining and learning analytics, aiming to build generalizable and interpretable models from students’ historical learning-process data to support personalized instruction and early academic warning. However, educational data are often high-dimensional and strongly temporal, with complex feature interactions, making it challenging for conventional regression approaches to jointly capture temporal regularities and nonlinear dependencies. To address this issue, we propose PAM-MLP , a student performance prediction model that integrates Position-Aware Attention (PAA) and Multi-head Self-Attention (MSA) . The PAA module incorporates learnable positional encodings to capture stage-wise and periodic patterns in learning trajectories, and adopts a gated scaled dot-product attention to dynamically adjust the importance of different time steps. Meanwhile, the MSA module models feature dependencies from multiple perspectives, enhanced by adaptive head weighting and a non-uniform attention distribution strategy to better characterize heterogeneous learning behaviors. On top of attention-based representations, a multi-layer perceptron is employed to capture higher-order nonlinear interactions and improve regression fitting. Experimental results show that PAM-MLP consistently outperforms competitive regression baselines, achieving improvements of 9% , 11% , and 10% in MAE , RMSE , and R² , respectively, demonstrating its effectiveness and robustness for student performance prediction in educational settings.

Version published to 10.21203/rs.3.rs-8702824/v1 on Research Square
Feb 11, 2026

Interpretable Deep Knowledge Tracing and Visualization of Learner Progress with Attention-Based Models

This article has 2 authors:
1. Patrick O. Akinwumi
2. Meihua Qian
This article has no evaluationsLatest version Jan 23, 2026
Towards Predictive Modeling of High School Students’ Graduation Mentions Using Classification Techniques

This article has 6 authors:
1. Tarik Ahajjam
2. Sara Jbaida
3. Amina Hassani
4. Mohammed moutib
5. Haidar Aissa
6. Yousef Farhaoui
This article has no evaluationsLatest version Feb 27, 2026
Application of Deep Learning Algorithms in Improving Teaching Quality Assessment of Applied Psychology Micro-Major Programs

This article has 4 authors:
1. Yang Meng
2. Shuzhu Tang
3. Jun Niu
4. Yizhuo Wang
This article has no evaluationsLatest version Mar 12, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Interpretable Deep Knowledge Tracing and Visualization of Learner Progress with Attention-Based Models

Towards Predictive Modeling of High School Students’ Graduation Mentions Using Classification Techniques

Application of Deep Learning Algorithms in Improving Teaching Quality Assessment of Applied Psychology Micro-Major Programs