Wavelet-Domain Privacy SGD (WDP-SGD): FrequencySelective Privacy-Preserving Medical AI.

Swathi Lakshmi Durga K
Valli Kumari Vatsavayi

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Protecting sensitive medical data during training is critical because transformer gradients can leak patient-specific information. We introduce a privacy-preserving clinical AI framework that integrates three complementary elements: (i) Bayesian synthetic data generation to produce epidemiologically realistic yet non-identifiable electronic health records, (ii) Wavelet-Domain Privacy Stochastic Gradient Descent(WDP-SGD) to apply frequency-selective noise to gradient updates of BERT-based classifiers, and (iii) multi-modal privacy auditing to empirically monitor potential information leakage. Unlike conventional differential privacy, which injects uniform noise, WDP-SGD perturbs high-frequency gradient components that disproportionately encode patient-specific information while preserving low-frequency components containing generalisable medical knowledge. Applied to a large synthetic medical text corpus covering multiple conditions, our approach consistently delivers stronger privacy protection and improved model performance relative to standard DP-SGD while maintaining convergence behaviour close to a non-private baseline. Privacy attack simulations, including membership inference, attribute inference and gradient reconstruction, further demonstrate enhanced resilience to adversarial attempts to extract sensitive information. These results indicate that wavelet-based differential privacy offers a practical pathway to privacy-conscious clinical language models, achieving a more favourable balance between privacy and utility than existing uniform-noise methods.

Version published to 10.21203/rs.3.rs-7778273/v1 on Research Square
Oct 7, 2025

Privacy-PreservingLLM Middleware in LIS: Edge-Computing for Coagulation InterpretationUnder High-Dimensional Noise

This article has 1 author:
1. Zihao Zhou
This article has no evaluationsLatest version Mar 23, 2026
Multi-Modal Federated Learning with Differential Privacy for Privacy-Preserving Healthcare AI

This article has 6 authors:
1. MdRokibul Hasan
2. Md Istiaq Ahmed
3. Sudip Saha
4. Tashnim Khan Ishika
5. Hashibul Ahsan Shoaib
6. Md. Jakir Hossen
This article has no evaluationsLatest version Mar 20, 2026
FedEmoNet: Privacy-Preserving FederatedLearning with TCN-Transformer Fusion forCross-Corpus Speech Emotion Recognition

This article has 6 authors:
1. MOHAMMED TAWFIK
2. Razan Ali Obeidat
3. Saddam Kamel
4. Njood Anwer Aljarrah
5. Haneen Hussein Shehadeh
6. Ahmad Dalalah
This article has no evaluationsLatest version Feb 9, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Privacy-PreservingLLM Middleware in LIS: Edge-Computing for Coagulation InterpretationUnder High-Dimensional Noise

Multi-Modal Federated Learning with Differential Privacy for Privacy-Preserving Healthcare AI

FedEmoNet: Privacy-Preserving FederatedLearning with TCN-Transformer Fusion forCross-Corpus Speech Emotion Recognition