Towards Automated Neonatal EEG Analysis: Multi-Center Validation of a Reliable Deep Learning Pipeline

Tim Hermans
Anneleen Dereymaeker
Katrien Lemmens
Katrien Jansen
Fatima Usman
Shellie Robinson
Gunnar Naulaers
Maarten De Vos
Caroline Hartley

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Objective

To evaluate the reliability and generalization of NeoNaid, a fully automated software tool for neonatal EEG analysis, based on functional brain age (FBA) estimation and sleep staging.

Methods

NeoNaid combines a multi-task deep learning model with proposed quality control routines detecting artefacts, out-of-distribution inputs, and uncertain predictions. Based on a raw EEG input, it outputs one global FBA estimate and a continuous 2-state hypnogram. We validated performance on an two independent hospital settings: an internal dataset (33 EEGs, 17 infants, median 900 minutes/recording) and an external dataset (38 EEGs, 24 infants, median 124 minutes/recording).

Results

Quality control rejected comparable number of segments in the internal and external datasets, reducing extreme errors in FBA estimation, and modestly improving sleep staging accuracy. Across the internal and external data, NeoNaid achieved median absolute FBA errors of 0.50 and 0.55 weeks and Cohen’s Kappa values of 0.89 and 0.87 for quiet sleep detection, respectively.

Conclusions

NeoNaid demonstrated improved reliability through integrated quality control and robust generalization across recording setups.

Significance

By focusing on validation and trustworthiness, this work takes an essential step toward clinical adoption of automated neonatal EEG analysis and supports its utility for both NICU practice and large-scale research.

Version published to 10.1101/2025.10.16.25338113 on medRxiv
Oct 17, 2025