Deep Learning for Blood Glucose Prediction: Reproducibility Challenges and Factors Affecting Differential Performance

Temiloluwa Prioleau
Baiying Lu
Biratal Wagle
Yanjun Cui

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Blood glucose prediction is a fundamental part of advanced technology that promises to improve diabetes outcomes. However, a critical gap exists around understanding the reproducibility of state-of-the-art methods for blood glucose prediction. In this study, we curated 60 deep learning (DL)-based glucose prediction papers published between 2018–2025 and assessed them against seven established reproducibility criteria. We found that code availability, overreliance on a single public dataset, and limited use of multiple datasets for algorithm development and evaluation are amongst the top challenges to reproducibility. Next, we replicated six representative models from well-cited prior literature using 4602 days of data with over 1.25 million glucose samples from 128 persons with type 1 diabetes across three public datasets - OhioT1DM, DiaTrend, and T1DEXI. Our results show good reproducibility of DL methods when using the same code (where available) and same evaluation dataset. However, we found poor conceptual reproducibility across datasets with significantly different diabetes management. Further analyses revealed that the accuracy of blood glucose prediction methods was significantly associated with individual diabetes management and sex/gender. All models had significantly higher prediction errors for individuals with worse glycemic control and for female subgroups compared to males. To accelerate development of robust and equitable algorithms for diabetes management, we conclude with recommendations for future researchers centered considerations for data selection, model design and selection, model evaluation and reporting results, documentation and code release.

Version published to 10.21203/rs.3.rs-7024301/v1 on Research Square
Jul 21, 2025

Comparative Evaluation of Machine Learning and Deep Learning Models for Blood Glucose Prediction on the OhioT1DM Dataset

This article has 10 authors:
1. Taofiq Olanrewaju MUSA
2. Arsene ADJEVI
3. Donaldo Omondi JACCOJWANG
4. Nasirudeen ADELEYE
5. Diyaolu Abdulmalik OPEYEMI
6. Süleyman UZUN
7. Mustafa Zahid YILDIZ
8. Ali LAZIM
9. Rhobi Peter
10. Selçuk YAYLACI
This article has no evaluationsLatest version Aug 21, 2025
Assumption-Agnostic Deep Learning Framework for Holistic Clinical Trial Monitoring

This article has 3 authors:
1. Shaoming Yin
2. Zheyang Wu
3. Jianchang Lin
This article has no evaluationsLatest version Aug 1, 2025
WITHDRAWN: Using Machine Learning to Improve Cancer Diagnosis Accuracy Through Genetic Data Analysis

This article has 4 authors:
1. Bassam Elzaghmouri
2. Marwan Abo zanoneh
3. Feras Fares AL-Mashakbah
4. Saad Mamoun AbdelRahman Ahmed
This article has no evaluationsLatest version Jul 18, 2025

Listed in

Abstract

Article activity feed

Related articles

Comparative Evaluation of Machine Learning and Deep Learning Models for Blood Glucose Prediction on the OhioT1DM Dataset

Assumption-Agnostic Deep Learning Framework for Holistic Clinical Trial Monitoring

WITHDRAWN: Using Machine Learning to Improve Cancer Diagnosis Accuracy Through Genetic Data Analysis