Evaluating the Predictive Accuracy of Deep Learning Algorithms for Postoperative Mortality in Cardiac Surgery: A Systematic Review and Meta-Analysis

Ibrahim Ibrahim Shuaibu
Ahmad Yaseen Al Mahmoud
Ibrahem Aaroud
Abdalsalam Rizq Abazid
Mohamed Helmy Mohamed Abdelsalaam
Numaira Naeem Gazge
Mazen Mohammed Saad Alabed
Shahd Eltayeb
Sobhan Pahlavan Zadeh

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Background: Risk stratification in cardiac surgery has long depended on logistic regression models built from a fixed set of preoperative variables an approach that, while extensively validated, cannot capture the complexity of real patient physiology. Deep learning (DL) offers a fundamentally different paradigm, one capable of detecting non-linear interactions across high-dimensional datasets. We conducted this systematic review and meta-analysis to quantify whether that theoretical advantage translates into measurably better prediction of postoperative mortality after cardiac SurgeryMethods: We searched PubMed/MEDLINE, Embase, and IEEE Xplore following PRISMA 2020 and Cochrane Prognosis Methods Group guidelines. Eligible studies directly compared DL architectures against established risk scores namely EuroSCORE II or STS-PROM for short-term mortality in adult cardiac surgery populations. Methodological quality was assessed with PROBAST+AI. Because raw AUC values are bounded and violate normality assumptions required for standard pooling, all estimates were logit-transformed prior to meta-analysis using a restricted maximum likelihood random-effects model.Results: Six studies met inclusion criteria, representing 250,560 patients across markedly different clinical settings. Deep learning models shows to have achieved a pooled AUC of 0.856 (95% CI: 0.774 - 0.913). This came with a caveat: between-study heterogeneity was substantial (I² = 91.3%), reflecting the diversity of architectures, cohort sizes, and institutional contexts included. Traditional risk scores yielded a pooled AUC of 0.815 (95% CI: 0.754–0.864; I² = 77.9%).Conclusion: DL models outperform conventional risk scores on discrimination. The gap, however, sits alongside serious unresolved questions heterogeneity is high, calibration data are largely absent from the primary literature, and most evidence comes from retrospective single-centre cohorts. Standardized reporting frameworks are a prerequisite, not a recommendation, before these models enter routine clinical practice.

Version published to 10.20944/preprints202603.2465.v1
Mar 31, 2026

Interpretable machine learning to predict postoperative adverse outcomes in cardiac surgery

This article has 8 authors:
1. li lei
2. Dengkang Qin
3. Mengxue Liu
4. Die Ma
5. Xia lei
6. Si Zeng
7. Zheng Chen
8. Qian Lei
This article has no evaluationsLatest version Feb 24, 2026
Prospective Multicenter Validation of Machine Learning Models for Mortality Prediction in Adult Critically Ill Patients using Transfer Learning

This article has 19 authors:
1. Ioannis Papapanagiotou
2. Charikleia S. Vrettou
3. Maria Theodorakopoulou
4. Zafiria Mastora
5. Vassiliki Giannopoulou
6. Olga Kampouropoulou
7. Apostolos Karalis
8. Maria Pratikaki
9. Spyretta Golemati
10. Georgios Poupouzas
11. Vasileios Issaris
12. Kyriakos Karkoulias
13. Sofia Pouriki
14. Chrysi Keskinidou
15. Nikolaos S. Lotsios
16. Anastasia Kotanidou
17. Alice G. Vassiliou
18. Stavros Papapanagiotou
19. Ioanna Dimopoulou
This article has no evaluationsLatest version Feb 19, 2026
Development and external validation of a machine learning model for predicting immediate postoperative instability in adults undergoing cardiac surgery

This article has 13 authors:
1. Hyeon Cheun
2. Jang Ho Ahn
3. Jin-Woo Park
4. Seo Hee Ko
5. Jae-Kwang Shim
6. Hyeonhoon Lee
7. Youn Joung Cho
8. Karam Nam
9. Jae-Woo Ju
10. Jaeyeon Chung
11. In Jung Kim
12. Hyung-Chul Lee
13. Yunseok Jeon
This article has no evaluationsLatest version Feb 16, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Interpretable machine learning to predict postoperative adverse outcomes in cardiac surgery

Prospective Multicenter Validation of Machine Learning Models for Mortality Prediction in Adult Critically Ill Patients using Transfer Learning

Development and external validation of a machine learning model for predicting immediate postoperative instability in adults undergoing cardiac surgery