Generalizable and Resilient Multimodal Temporal Learning

Kieran Whitlow
Wyne Nasir
Amara Deslauriers

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Comprehending human sleep mechanisms is vital for diagnosing a range of neurological and physiological conditions. Traditional sleep staging relies on expert annotation of polysomnographic recordings, a process that is labor-intensive and susceptible to inconsistency. Although automated sleep staging has gained traction, most current systems depend predominantly on EEG signals, which limits their robustness in clinical scenarios where signal quality is often compromised. In this work, we propose \textbf{MedFuseSleep}, a multimodal temporal learning architecture built to classify sleep stages under imperfect data conditions. The model is specifically designed to maintain high performance even in the presence of missing or noisy inputs by adaptively incorporating EEG, EOG, and auxiliary physiological modalities. Drawing inspiration from mid-to-late fusion strategies and grounded in a multi-objective learning framework, MedFuseSleep facilitates cross-modal representation learning while preserving tolerance to corrupted or absent signals. This design enables effective sleep stage inference even when key modalities such as EEG are degraded or unavailable. We validate MedFuseSleep on the SHHS-1 dataset, a large-scale benchmark, and report consistent gains over both unimodal baselines and existing multimodal techniques. Notably, we find that multimodal training not only improves performance on full data but also leads to better unimodal generalization compared to training with unimodal inputs alone. Our findings emphasize the utility of resilient multimodal modeling and advocate for broader integration of robust fusion techniques in clinical time series applications.

Version published to 10.20944/preprints202507.0011.v1
Jul 1, 2025

Uncertainty-Aware Deep Learning for Robust and Interpretable MI EEG using Channel Dropout and LayerCAM Integration

This article has 5 authors:
1. Óscar Wladimir Gómez-Morales
2. Sofia Escalante-Escobar
3. Diego Fabian Collazos-Huertas
4. Andrés Marino Álvarez-Meza
5. German Castellanos-Dominguez
This article has no evaluationsLatest version Jun 11, 2025
A Unified Flexible Large Polysomnography Model for Sleep Staging and Mental Disorder Diagnosis

This article has 17 authors:
1. Haiteng Jiang
2. Guifeng Deng
3. Mengfan Niu
4. Shuying Rao
5. Yuxi Luo
6. Jianjia Zhang
7. Junyi Xie
8. Zhenghe Yu
9. Wenjuan Liu
10. Junhang Zhang
11. Sha Zhao
12. Gang Pan
13. Xiaojing Li
14. Wei Deng
15. Wanjun Guo
16. Yaoyun Zhang
17. Tao Li
This article has no evaluationsLatest version Jun 6, 2025
Detection of Cortical Arousals in Sleep Using Multimodal Wearable Sensors and Machine Learning

This article has 10 authors:
1. Murat Kucukosmanoglu
2. Sarah Conklin
3. Kanika Bansal
4. Sena Kaya
5. Yumna Anwar
6. Quang Dang
7. Golshan Kargosha
8. Justin Brooks
9. Cody Feltch
10. Nilanjan Banerjee
This article has no evaluationsLatest version May 16, 2025

Listed in

Abstract

Article activity feed

Related articles

Uncertainty-Aware Deep Learning for Robust and Interpretable MI EEG using Channel Dropout and LayerCAM Integration

A Unified Flexible Large Polysomnography Model for Sleep Staging and Mental Disorder Diagnosis

Detection of Cortical Arousals in Sleep Using Multimodal Wearable Sensors and Machine Learning