Federated Learning-Driven Health Risk Prediction on Electronic Health Records Under Privacy Constraints

Ran Hao
Wei-Chen Chang
Jiacheng Hu
Min Gao

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

This study addresses the challenges of privacy protection and data silos in the intelligent analysis and health risk prediction of electronic health records by proposing a federated learning-based framework. In this framework, data from different medical institutions do not require centralized storage but instead achieve cross-institutional collaborative optimization through local model training and secure parameter aggregation, thereby improving model performance under conditions of compliance and privacy protection. A multimodal feature fusion mechanism is introduced to jointly model structured diagnostic information, clinical text, and time-series data, enabling the capture of complex semantic associations and temporal relationships across modalities. At the output stage, the model adopts a probabilistic prediction strategy optimized with cross-entropy loss, which effectively enhances the accuracy and stability of risk identification. Experiments conducted on a public electronic health record dataset show that the proposed method outperforms several baseline models in accuracy, precision, recall, and F1-Score, achieving a good balance between privacy protection and predictive robustness. Overall, this study establishes an integrated framework that combines federated learning with multimodal modeling, providing a feasible path for the efficient use of electronic health records and health risk prediction, while demonstrating significant advantages in improving the value of medical data and supporting better health management decisions.

Version published to 10.20944/preprints202510.1471.v1
Oct 20, 2025

Federated learning enabled privacy-preserving data access for predicting 30-day mortality in acute myocardial infarction

This article has 4 authors:
1. Koutarou Matsumoto
2. Yuta Nakamura
3. Masahiro Kamouchi
4. Ewout Steyerberg
This article has no evaluationsLatest version Sep 22, 2025
SplitFed Learning for Dementia Diagnosis: A Privacy-Preserving Framework for Distributed Healthcare

This article has 4 authors:
1. Neha Kalbande
2. Amar Nath Patra
3. Raj Mani Shukla
4. Oliver Faust
This article has no evaluationsLatest version Oct 8, 2025
Hierarchical Personalized Continual Federated Learning for Real Time Risk Prediction of Chronic Diseases

This article has 4 authors:
1. Abhigyan Ghoshal
2. Mohammad Armaan Ali
3. M. Sambath
4. E. Balraj
This article has no evaluationsLatest version Oct 8, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Federated learning enabled privacy-preserving data access for predicting 30-day mortality in acute myocardial infarction

SplitFed Learning for Dementia Diagnosis: A Privacy-Preserving Framework for Distributed Healthcare

Hierarchical Personalized Continual Federated Learning for Real Time Risk Prediction of Chronic Diseases