Predictive Modelling of Linear Growth Faltering Among Pediatric Patients with Diarrhea in Rural Western Kenya: An Explainable Machine Learning Approach

Billy Ogwel
Vincent H. Mzazi
Alex O. Awuor
Caleb Okonji
Raphael O. Anyango
Caren Oreso
John B. Ochieng
Stephen Munga
Dilruba Nasrin
Kirkby D. Tickell
Patricia B. Pavlinac
Karen L. Kotloff
Richard Omore

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Introduction: Stunting affects one-fifth of children globally with diarrhea accounting for an estimated 13.5% of stunting. Identifying risk factors for its precursor, linear growth faltering (LGF), is critical to designing interventions. Moreover, developing new predictive models for LGF using more recent data offers opportunity to improve model performance and capture new insights. We employed machine learning (ML) to derive and validate a predictive model for LGF among children enrolled with diarrhea in the Vaccine Impact on Diarrhea in Africa (VIDA) study and the Enterics for Global Heath (EFGH) ― Shigella study in rural western Kenya. Methods We used 7 ML algorithms to retrospectively build prognostic models for the prediction of LGF (≥ 0.5 decrease in height/length for age z-score [HAZ]) among children 6–35 months. We used de-identified data from the VIDA study (n = 1,473) combined with synthetic data (n = 8,894) in model development, which entailed split-sampling and K-fold cross-validation with over-sampling technique, and data from EFGH-Shigella study (n = 655) for temporal validation. Potential predictors included demographic, household-level characteristics, illness history, anthropometric and clinical data chosen using an explainable model agnostic approach. The champion model was determined based on the area under the curve (AUC) metric. Results The prevalence of LGF in the development and temporal validation cohorts was 187 (16.9%) and 147 (22.4%), respectively. The following variables were associated with LGF in decreasing order: age (16.6%), temperature (6.0%), respiratory rate (4.1%), SAM (3.4%), rotavirus vaccination (3.3%), breastfeeding (3.3%), and skin turgor (2.1%). While all models showed good prediction capability, the gradient boosting model achieved the best performance (AUC% [95% Confidence Interval]: 83.5 [81.6–85.4] and 65.6 [60.8–70.4] on the development and temporal validation datasets, respectively). Conclusion Our findings accentuates the enduring relevance of established predictors of LGF whilst demonstrating the practical utility of ML algorithms for rapid identification of at-risk children.

Version published to 10.21203/rs.3.rs-4047381/v1 on Research Square
Mar 15, 2024

Derivation and Validation of a Clinical Predictive Model for Longer Duration Diarrhea among Pediatric Patients in Kenya using Machine Learning Algorithms

This article has 13 authors:
1. Billy Ogwel
2. Vincent Mzazi
3. Alex O. Awuor
4. Caleb Okonji
5. Raphael O. Anyango
6. Caren Oreso
7. John B. Ochieng
8. Stephen Munga
9. Dilruba Nasrin
10. Kirkby D. Tickell
11. Patricia B. Pavlinac
12. Karen L. Kotloff
13. Richard Omore
This article has no evaluationsLatest version Mar 15, 2024
Assessing Screening Methods and Machine Learning for Predicting Childhood Overweight and Obesity: A Population-Based Study

This article has 7 authors:
1. Irit Lior-Sadaka
2. Shahar Melamed
3. Itamar Grotto
4. Yair Sadaka
5. Roni Eilenberg
6. Moshe Uziel
7. Dan Greenberg
This article has no evaluationsLatest version Apr 11, 2024
Early Prognosis Prediction for Non-variceal Upper Gastrointestinal Bleeding in the Intensive Care Unit: Based on Interpretable Machine Learning

This article has 7 authors:
1. Xiaoxu Zhao
2. Shuxing Wei
3. Yujie Pan
4. Kunlong Qu
5. Guanghao Yan
6. Xiya Wang
7. Yuguo Song
This article has no evaluationsLatest version Mar 28, 2024

Listed in

Abstract

Article activity feed

Related articles

Derivation and Validation of a Clinical Predictive Model for Longer Duration Diarrhea among Pediatric Patients in Kenya using Machine Learning Algorithms

Assessing Screening Methods and Machine Learning for Predicting Childhood Overweight and Obesity: A Population-Based Study

Early Prognosis Prediction for Non-variceal Upper Gastrointestinal Bleeding in the Intensive Care Unit: Based on Interpretable Machine Learning