<span style="font-size: 12.0pt; mso-bidi-font-size: 10.0pt; font-family: 'Times New Roman',serif; mso-fareast-font-family: 'Times New Roman'; color: black; mso-ansi-language: EN-US; mso-fareast-language: DE; mso-bidi-language: AR-SA;">Explainable Machine Learning in the Prediction of Depression

Read the full article See related articles

Listed in

This article is not in any list yet, why not save it to one of your lists.
Log in to save this article

Abstract

Background: Depression constitutes a major public health issue, being one of the leading causes of burden of disease worldwide. The risk of depression is determined by both genetic and environmental factors. While genetic factors cannot be altered, the identification of potentially reversible environmental factors is crucial in order to try and limit the prevalence of depression. Aim: A cross-sectional questionnaire-based study on a sample from the multicultural region of Thrace in northeast Greece was designed to assess the potential association of depression with several socio-demographic characteristics, lifestyle and health status. The study employed four machine learning (ML) methods to assess depression: Logistic Regression (LR), Support Vector Machine (SVM), XGBoost, and Neural Networks (NNs). These models were compared to identify the best-performing approach. Additionally, a Genetic Algorithm (GA) was utilized for feature selection and SHAP (SHapley Additive exPlanations) for interpreting the contributions of each employed feature. Results: The XGBoost classifier demonstrated the highest performance on the test dataset to predict depression with excellent accuracy (97.83%), with NNs a second close (accuracy, 97.02%). The XGBoost classifier utilized the 15 most significant risk factors identified by the GA algorithm. Additionally, the SHAP analysis revealed that anxiety, education level, alcohol consumption and body mass index were the most influential predictors of depression. Conclusions: These findings provide valuable insights for the development of personalized public health interventions and clinical strategies, ultimately promoting improved mental well-being for individuals. Future research should expand datasets to enhance model accuracy, enabling early detection and personalized mental healthcare systems for better intervention.

Article activity feed