Domain shifts decrease the accuracy of machine learning algorithms to detect myocardial ischemia in ECG recordings: A multi-database analysis

Sandra Frank
Martin W. Dünser
Thomas Tschoellitsch
Akos Filakovszky
Victoria Habsburg-Lothringen
Günter Klambauer
Jens Meier

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Machine learning algorithms have shown excellent accuracy in detecting myocardial ischemia from electrocardiograms, but their clinical reliability remains uncertain. In this study, we explored how domain shifts between different datasets affect the performance of these algorithms. We used six publicly available 12-lead ECG databases, containing a total of 55,953 recordings, and applied several machine learning approaches, including generalized linear models, random forests, gradient boosting, deep neural networks, and ensemble learning. The models were evaluated under different conditions—within individual datasets, across all datasets combined, and using leave-one-dataset-out validation.When trained and tested on the same dataset, the models performed very well, with area under the curve values exceeding 0.90. However, performance dropped notably when the models were tested on data from different sources. Sensitivity decreased substantially, while specificity remained relatively stable. Further analysis showed that variations in ECG patterns across datasets contributed to these differences in performance.Overall, our results demonstrate that while machine learning algorithms can detect myocardial ischemia accurately within familiar data, their ability to generalize across diverse datasets is limited. Addressing dataset heterogeneity and improving model robustness will be essential before such systems can be reliably implemented in clinical practice.

Version published to 10.21203/rs.3.rs-7893141/v1 on Research Square
Nov 13, 2025

Reliability of Artificial Intelligence-enhanced Electrocardiography

This article has 7 authors:
1. Lovedeep S Dhingra
2. Philip M Croon
3. Bruno Batinica
4. Arya Aminorroaya
5. Aline F Pedroso
6. Evangelos K Oikonomou
7. Rohan Khera
This article has no evaluationsLatest version Nov 6, 2025
Cardiac Classification with Multi-Scale Convolutional Neural Network From Paper ECG

This article has 3 authors:
1. Xue Cheng
2. Jiang Yi
3. Gao Peng
This article has no evaluationsLatest version Oct 7, 2025
A Fast, Lightweight, and Generalizable Deep Neural Network for the Detection of Atrial Fibrillation

This article has 6 authors:
1. Harshit Mishra
2. Farhan Adam Mukadam
3. Nachiket Makwana
4. Pradyot Tiwari
5. K. Subramani
6. K. V. S. Hari
This article has no evaluationsLatest version Nov 15, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Reliability of Artificial Intelligence-enhanced Electrocardiography

Cardiac Classification with Multi-Scale Convolutional Neural Network From Paper ECG

A Fast, Lightweight, and Generalizable Deep Neural Network for the Detection of Atrial Fibrillation