Interpretable Machine Learning for Life Expectancy Prediction: A Comparative Study of Linear Regression, Decision Tree, and Random Forest

Roman Dolgopolyi
Ioanna Amaslidou
Agrippina Margaritou

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Life expectancy is a fundamental indicator of population health and socio-economic well-being, yet accurately forecasting it remains challenging due to the interplay of demographic, environmental, and healthcare factors. This study evaluates three machine learning models—Linear Regression (LR), Re- gression Decision Tree (RDT), and Random Forest (RF), using a real-world da- taset drawn from World Health Organization (WHO) and United Nations (UN) sources. After extensive preprocessing to address missing values and inconsist- encies, each model’s performance was assessed with R2, Mean Absolute Error (MAE), and Root Mean Squared Error (RMSE). Results show that RF achieves the highest predictive accuracy (R2 = 0.9423), significantly outperforming LR and RDT. Interpretability was prioritized through p-values for LR and feature- importance metrics for the tree-based models, revealing immunization rates (diphtheria, measles) and demographic attributes (HIV/AIDS, adult mortality) as critical drivers of life-expectancy predictions. These insights underscore the syn- ergy between ensemble methods and transparency in addressing public-health challenges. Future research should explore advanced imputation strategies, alter- native algorithms (e.g., neural networks), and updated data to further refine pre- dictive accuracy and support evidence-based policymaking in global health con- texts.

Version published to 10.21203/rs.3.rs-6968809/v1 on Research Square
Jun 26, 2025

SHAP-LR: An Interpretable Logistic Regression Model for Coronary Heart Disease Risk Prediction

This article has 3 authors:
1. Peihua Tong
2. Hui Hu
3. Ling Tong
This article has no evaluationsLatest version Jun 9, 2025
Interpretable Machine Learning for Mortality Risk Detection in National Health Data

This article has 4 authors:
1. J. CHA
2. E.D. CHA
3. E. Yoo
4. H. Song
This article has no evaluationsLatest version Jun 20, 2025
Enhancing Mental Health Decision-Making with Artificial Intelligence/Machine Learning: A Prescriptive Analytics Approach for Customised Outcomes

This article has 7 authors:
1. Mark Payne
2. Fareed Ud Din
3. Kabir Sattarshetty
4. Cassandra Sundaraja
5. Anwaar Ul-Haq
6. Theresa Scott
7. Niusha Shafiabady
This article has no evaluationsLatest version Jun 17, 2025

Listed in

Abstract

Article activity feed

Related articles

SHAP-LR: An Interpretable Logistic Regression Model for Coronary Heart Disease Risk Prediction

Interpretable Machine Learning for Mortality Risk Detection in National Health Data

Enhancing Mental Health Decision-Making with Artificial Intelligence/Machine Learning: A Prescriptive Analytics Approach for Customised Outcomes