Markovian Representation Learning for Longitudinal Electronic Health Records

Samuel Lawrence
Yuchen Zhang
Mingyu Wang

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Deep learning models for electronic health records are typically trained with token-level reconstruction or task-specific supervision, encouraging contextual predictability rather than explicit modeling of temporal dynamics. We propose a Markovian representation learning framework that treats longitudinal EHR data as realizations of a latent stochastic process and trains embeddings to admit low-complexity transition operators in latent space. By optimizing transition likelihood alongside information-preserving objectives, the learned representations approximate sufficient statistics of underlying clinical state, yielding compact dynamical modes with interpretable time scales. Across multiple prediction tasks and external hospital transfer settings, Markov-structured pretraining improves discrimination, temporal forecasting performance, and robustness relative to supervised and masked-event baselines. We further demonstrate that the learned transition structure supports LLM-assisted summarization and agentic cohort exploration, enabling clinician-friendly interpretation of latent progression patterns. These results suggest that explicitly modeling temporal transition structure provides a principled complement to token-centric foundation modeling for longitudinal medical data.

Version published to 10.21203/rs.3.rs-8935515/v1 on Research Square
Feb 24, 2026

Foundation Model for Biological Temporal Data Dynamics with Experimental Validation

This article has 2 authors:
1. Xiaoyu Duan
2. Vipul Periwal
This article has no evaluationsLatest version Mar 12, 2026
Retrieval-aligned Tabular Foundation Models Enable Robust Clinical Risk Prediction in Electronic Health Records Under Real-world Constraints

This article has 11 authors:
1. Minh-Khoi Pham
2. Thang-Long Nguyen Ho
3. Thao Thi Phuong Dao
4. Tai Tan Mai
5. Minh-Triet Tran
6. Marie Elizabeth Ward
7. Una Geary
8. Rob Brennan
9. Nick McDonald
10. Martin Crane
11. Marija Bezbradica
This article has no evaluationsLatest version Mar 19, 2026
A Novel Dynamic Graph Architecture for Staging Parkinson’s Disease Progression Using Cerebrospinal Fluids Longitudinal Profiles

This article has 5 authors:
1. Lubna Mahmoud Abuzohair
2. Hind Zantout
3. Md Azher Uddin
4. Marta Vallejo
5. John Woodward
This article has no evaluationsLatest version Mar 8, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Foundation Model for Biological Temporal Data Dynamics with Experimental Validation

Retrieval-aligned Tabular Foundation Models Enable Robust Clinical Risk Prediction in Electronic Health Records Under Real-world Constraints

A Novel Dynamic Graph Architecture for Staging Parkinson’s Disease Progression Using Cerebrospinal Fluids Longitudinal Profiles