A Data Privacy Protection Method for Infectious Disease Prediction Models with Balanced Training Speed and Accuracy

Xinhang Wang
Guangming Pan
Zhen Luo
Ming Xiao
Li Yang
Xiaoqiu Shi
Ying Huo
Mianyang Li
Le Zhang

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Recently, the application of deep learning technologies in the construction of infectious disease prediction models has significantly increased their auxiliary effectiveness in formulating prevention and control strategies for infectious diseases. Typically, scientists rely on extensive datasets to thoroughly train models, aiming to generate ones with high predictive accuracy to forecast the occurrence trends of emerging infectious diseases. However, given the inherent need for privacy protection in medical data, many institutions are reluctant to readily share their data resources due to the compliance and security concerns, which not only directly limits the comprehensiveness and diversity of training data, but also decreases the predictive accuracy on infectious disease predictive models.To address these issues, firstly, we propose a Random Transmission Hybrid Homomorphic Algorithm, which enhances the efficiency of the model by utilizing a random transmission sequence and a hybrid approach that combines semi-homomorphic and fully homomorphic algorithms. Secondly, we developed a DS-DSSGD (Data Select-Distributed selective stochastic Gradient descent) algorithm to balance the training speed and predictive accuracy for the model after incorporating privacy-preserving computational technologies. Finally, we have established a scientific research collaboration platform, XDP, to integrate data from multiple users and provide end-to-end lifecycle management for data.

Version published to 10.21203/rs.3.rs-7728227/v1 on Research Square
Dec 3, 2025

FedVQC for Genomic Data: A Quantum-Enhanced Privacy Approach

This article has 3 authors:
1. Vaidehi Gawande
2. Jayesh V. Hire
3. Sagar Dhande
This article has no evaluationsLatest version Jan 30, 2026
Personalized Disease Risk Prediction from Multimodal Health Data Using Large Language Models

This article has 2 authors:
1. Hanieh Arjmand
2. Alexandre Tomberg
This article has no evaluationsLatest version Jan 25, 2026
A Study on Explainable Artificial Intelligence(XAI) in Malware Detection for Proactive Cyber Threat Hunting

This article has 3 authors:
1. Pankaj Gajakosh S.
2. Rama Abirami K.
3. Nagendra Kumar Y. J.
This article has no evaluationsLatest version Dec 23, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

FedVQC for Genomic Data: A Quantum-Enhanced Privacy Approach

Personalized Disease Risk Prediction from Multimodal Health Data Using Large Language Models

A Study on Explainable Artificial Intelligence(XAI) in Malware Detection for Proactive Cyber Threat Hunting