Distribution-Aware Federated Learning for Diabetes Prediction Using Tabular Clinical DataUnder Non-IID and Class-Imbalanced Settings

Rohul Amin
Md. Mehedi Hasan Rana
Sumya Aktar

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Federated learning (FL) enables collaborative clinical model training without centralized data sharing, yet its deployment is hindered by statistical heterogeneity (non-IID data) and inherent class imbalance across healthcare institutions. Conventional aggregation strategies such as FedAvg and FedProx weight client updates solely by dataset size, ignoring class distributions and thereby biasing the global model toward the majority class. To address this, we propose Distribution-Aware Federated Learning (DA-FL), which introduces a minority-class amplification factor \((\phi_k)\) computed as the ratio of a client’s local positive class rate to the global positive class rate. Combined with class-weighted cross-entropy loss at the client level, DA-FL forms a two-level correction mechanism that mitigates imbalance without additional data sharing. Experiments on the CDC BRFSS 2021 diabetes dataset (236,378 records across five simulated clients under three non-IID levels) show that DA-FL improves F1-Macro by 18.2% and G-Mean by 26.7% over FedAvg under moderate non-IID conditions, while achieving 31-fold greater F1-Macro stability across 30 communication rounds. These findings demonstrate that DA-FL is an effective and practically deployable solution for federated clinical prediction under realistic non-IID and class-imbalanced settings.

Version published to 10.21203/rs.3.rs-9056562/v1 on Research Square
Mar 18, 2026

Multi-Modal Federated Learning with Differential Privacy for Privacy-Preserving Healthcare AI

This article has 6 authors:
1. MdRokibul Hasan
2. Md Istiaq Ahmed
3. Sudip Saha
4. Tashnim Khan Ishika
5. Hashibul Ahsan Shoaib
6. Md. Jakir Hossen
This article has no evaluationsLatest version Mar 20, 2026
Case-Control Matching Erodes Feature Discriminability for Machine Learning-Based Sepsis Prediction in ICUs: A Retrospective Cohort Study

This article has 6 authors:
1. Sophia Ehlers
2. Youssef Farag
3. Fanny Tranchellini
4. Tim Hahn
5. Catherine Jutzeler
6. Lakmal Meegahapola
This article has no evaluationsLatest version Apr 9, 2026
FTAD-Net: Federated Transfer-Adversarial Learning for Robust Breast Cancer Detection

This article has 5 authors:
1. Atta Ur Rahman
2. Mahmood Alam
3. Bibi Saqia
4. Zahid Halim
5. Ehab Qahwash
This article has no evaluationsLatest version Apr 14, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Multi-Modal Federated Learning with Differential Privacy for Privacy-Preserving Healthcare AI

Case-Control Matching Erodes Feature Discriminability for Machine Learning-Based Sepsis Prediction in ICUs: A Retrospective Cohort Study

FTAD-Net: Federated Transfer-Adversarial Learning for Robust Breast Cancer Detection