Robust Federated Learning Against Data Poisoning Attacks: Prevention and Detection of Attacked Nodes

Pretom Roy Ovi
Aryya Gangopadhyay

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Federated learning (FL) enables collaborative model building among a large number of participants without sharing sensitive data to the central server. Because of its distributed nature, FL has limited control over local data and the corresponding training process. Therefore, it is susceptible to data poisoning attacks where malicious workers use malicious training data to train the model. Furthermore, attackers on the worker side can easily manipulate local data by swapping the labels of training instances, adding noise to training instances, and adding out-of-distribution training instances in the local data to initiate data poisoning attacks. And local workers under such attacks carry incorrect information to the server, poison the global model, and cause misclassifications. So, the prevention and detection of such data poisoning attacks is crucial to build a robust federated training framework. To address this, we propose a prevention strategy in federated learning, namely confident federated learning, to protect workers from such data poisoning attacks. Our proposed prevention strategy at first validates the label quality of local training samples by characterizing and identifying label errors in the local training data, and then excludes the detected mislabeled samples from the local training. To this aim, we experiment with our proposed approach on both the image and audio domains, and our experimental results validated the robustness of our proposed confident federated learning in preventing the data poisoning attacks. Our proposed method can successfully detect the mislabeled training samples with above 85% accuracy and exclude those detected samples from the training set to prevent data poisoning attacks on the local workers. However, our prevention strategy can successfully prevent the attack locally in the presence of a certain percentage of poisonous samples. Beyond that percentage, the prevention strategy may not be effective in preventing attacks. In such cases, detection of the attacked workers is needed. So, in addition to the prevention strategy, we propose a novel detection strategy in the federated learning framework to detect the malicious workers under attack. We propose to create a class-wise cluster representation for every participating worker by utilizing the neuron activation maps of local models and analyze the resulting clusters to filter out the workers under attack before model aggregation. We experimentally demonstrated the efficacy of our proposed detection strategy in detecting workers affected by data poisoning attacks, along with the attack types, e.g., label-flipping or dirty labeling. In addition, our experimental results suggest that the global model could not converge even after a large number of training rounds in the presence of malicious workers, whereas after detecting the malicious workers with our proposed detection method and discarding them from model aggregation, we ensured that the global model achieved convergence within very few training rounds. Furthermore, our proposed approach stays robust under different data distributions and model sizes and does not require prior knowledge about the number of attackers in the system.

Version published to 10.3390/electronics14152970
Jul 25, 2025
Version published to 10.20944/preprints202506.2218.v2
Jul 17, 2025
Version published to 10.20944/preprints202506.2218.v1
Jun 26, 2025

Resisting Against Targeted Poisoning Attacks in Lightweight Privacy-Preserving Federated Learning

This article has 3 authors:
1. Hongliang Zhang
2. Haojie Xie
3. Jiandong Lv
This article has no evaluationsLatest version Jul 18, 2025
The Adaptive Ensemble Learning-Based Intrusion Detection System for Enhanced Cybersecurity in Networked Environments

This article has 2 authors:
1. Kuldeep Kumar
2. Namrta Tanwar
This article has no evaluationsLatest version Aug 19, 2025
Distributed Detection of DDoS Attack on 5G Network Slices

This article has 3 authors:
1. Md Mahibul Hasan
2. Aymen Ben Said
3. Nashid Shahriar
This article has no evaluationsLatest version Jul 28, 2025

Listed in

Abstract

Article activity feed

Related articles

Resisting Against Targeted Poisoning Attacks in Lightweight Privacy-Preserving Federated Learning

The Adaptive Ensemble Learning-Based Intrusion Detection System for Enhanced Cybersecurity in Networked Environments

Distributed Detection of DDoS Attack on 5G Network Slices