Benchmarking Imputation Strategies for Missing Time-Series Data in Critical Care Using Real-World-Inspired Scenarios

Michael Poette
Sandrine Mouysset
Daniel Ruiz
Jean-Marc Alliot
Vincent Pey
Vincent Minville

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Handling missing data remains a central issue in ICU time-series analysis, where data gaps often stem from non-random factors such as sensor disconnections or clinical workflows. In this study, we systematically benchmarked several imputation strategies using monitoring data from MIMIC-IV and designed masking scenarios that reflect common ICU patterns, including random dropouts, temporary monitoring interruptions, and sensor-specific failures. We compared simple statistical approaches (mean, LOCF, interpolation), classical machine learning techniques (MICE, MissForest), and deep learning models (SAITS, BRITS, US-GAN, GP-VAE). SAITS, based on Transformer architecture, achieved the best performance in most settings. However, linear interpolation—despite its simplicity—yielded robust estimates in short univariate gaps and occasionally performed comparably to neural models. Our findings suggest that while deep learning methods improve overall imputation accuracy, simpler and more interpretable approaches may be sufficient for many ICU applications. This work introduces a practical framework for evaluating time-series imputation strategies under realistic constraints, with a focus on clinical relevance and operational deployability.

Version published to 10.21203/rs.3.rs-6973012/v1 on Research Square
Jul 2, 2025

Generative AI-Based Imputation to Preserve Data Fidelity and Enhance Outcome Prediction: A Multi-Institutional Study in Cardiac Surgery

This article has 11 authors:
1. Negin Maddah
2. Amin Ramezani
3. Qingchu Jin
4. Jakob Wollborn
5. Akinobu Itoh
6. Jaime B. Rabb
7. Felistas Mazhude
8. Robert S. Kramer
9. Douglas B. Sawyer
10. Raimond L. Winslow
11. Farhad R. Nezami
This article has no evaluationsLatest version Jan 23, 2026
A Framework for Locally Imputing and Predicting Biomarker Trajectories Under Irregular Monitoring: Application to Chronic Myeloid Leukemia

This article has 6 authors:
1. Felipe Montano-Campos
2. Patrick Heagerty
3. Eric Haupt
4. Erin Hahn
5. Jerald Radich
6. Aasthaa Bansal
This article has no evaluationsLatest version Jan 7, 2026
ICU Mortality and LOS Prediction Models Using MachineLearning Based on Both Real and Simulated Data

This article has 3 authors:
1. Girma Neshir Alemneh
2. Hirut Bekele Ashagrie
3. Lemlem Kassa Tegegne
This article has no evaluationsLatest version Jan 14, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Generative AI-Based Imputation to Preserve Data Fidelity and Enhance Outcome Prediction: A Multi-Institutional Study in Cardiac Surgery

A Framework for Locally Imputing and Predicting Biomarker Trajectories Under Irregular Monitoring: Application to Chronic Myeloid Leukemia

ICU Mortality and LOS Prediction Models Using MachineLearning Based on Both Real and Simulated Data