Who Fails and Why: An Analysis of Student Trajectories and the Prediction of Undergraduate Performance in Programming Courses

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.
Log in to save this article

Abstract

This study examines failure in introductory programming courses (CS1) in Chilean higher education by combining an analysis of academic trajectories with early-risk prediction models. We analyzed a cohort of 994 students from a Chilean technical uni-versity (2025–1), with a 46% failure rate, integrating pre-university academic and ad-mission variables (e.g., mathematics and language indicators, as well as baseline diag-nostic measures when available), sociodemographic information, and within-semester performance indicators. Group differences were assessed using non-parametric tests, and predictive performance was evaluated under two realistic information-availability scenarios: (i) pre-university variables only and (ii) variables available up to the first major written examination (C1). The results show statistically significant differences between students who passed and those who failed, with indicators of quantitative preparedness and, most notably, C1 performance emerging as the strongest signals of risk. In the pre-university scenario, models achieved acceptable discrimination (AUC ≈ 0.77), whereas incorporating C1 substantially improved discriminative performance (AUC ≈ 0.92) and increased precision in identifying at-risk students while reducing false positives. These findings support a staged institutional strategy: broad, low-cost pre-ventive support before the semester begins, followed by more targeted and intensive interventions after C1, thereby enabling more efficient early-warning systems in high-stakes first-year courses.

Article activity feed