Machine Learning Prediction of Educational Disparities in Somaliland with Algorithmic Fairness Evaluation

Jibril Abdikadir Ali
Mustafe Khadar Abdi
Abdisalam Hassan Muse
Mukhtar Axmed Cumar

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

This study, "Predicting Educational Disparity in Somaliland: A Machine Learning Analysis and Algorithmic Fairness Audit," aims to identify the primary drivers of educational disparity and assess the equity of a predictive model in a fragile, post-conflict context. Utilizing microdata from the 2020 Somaliland Health and Demographic Survey (SLHDS) for a sample of 17,686 women, we developed a Random Forest classification model to predict school attendance. The model achieved excellent predictive accuracy, with a Receiver Operating Characteristic Area Under the Curve (ROC-AUC) of 0.988, identifying spousal education and household wealth as the most significant predictors. However, a subsequent algorithmic fairness audit revealed a critical Equal Opportunity gap of 18.3 percentage points between the best- and worst-performing regions, indicating the model is significantly less effective for women in the Sool region. This finding demonstrates that even highly accurate models can perpetuate systemic inequities. For policy, this implies that deploying AI tools without rigorous fairness evaluations risks exacerbating marginalization; therefore, fairness audits must be a mandatory component of data-driven policymaking in fragile states to ensure interventions are both effective and equitable.

Version published to 10.21203/rs.3.rs-9031905/v1 on Research Square
Mar 26, 2026

Bias in School-Based Risk Prediction: Challenges for Equitable Practice

This article has 2 authors:
1. Adam Lockwood
2. Celeste Malone
This article has no evaluationsLatest version Apr 8, 2026
Algorithmic Disparities in Data-Driven Decision Systems: An Empirical Evaluation of Group-Level Error and Calibration Differences

This article has 1 author:
1. Anishka Paharia
This article has no evaluationsLatest version Mar 30, 2026
Development and interpretability analysis of a risk prediction model for Problematic Internet Use in adolescents

This article has 11 authors:
1. Rongmei Liu
2. Saiyi Wang
3. Clifford Silver Tarimo
4. Quanman Li
5. Qiurui Yu
6. Yifei Feng
7. Lipei Zhao
8. Shuaibin Liu
9. Xinghan Chen
10. Jian Wu
11. Qiuping Zhao
This article has no evaluationsLatest version Apr 8, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Bias in School-Based Risk Prediction: Challenges for Equitable Practice

Algorithmic Disparities in Data-Driven Decision Systems: An Empirical Evaluation of Group-Level Error and Calibration Differences

Development and interpretability analysis of a risk prediction model for Problematic Internet Use in adolescents