Optimizing K-Means Clustering with Privacy Budget Allocation Based on Variance and Sensitivity

Afzal Ali
Sreemoyee Biswas
Nilay Khare
Mansi Gyanchandani

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

This paper presents a novel approach for enhancing k-means clustering through a privacy-preserving budget allocation mechanism based on variance and sensitivity analysis. The proposed method aims to balance the trade-off between data utility and privacy preservation by selectively allocating privacy budgets across features, emphasizing features with higher variance and lower sensitivity to maintain clustering accuracy. We employ differential privacy techniques, particularly the Laplace mechanism, to introduce controlled noise, protecting user data while minimizing information loss. Comparative analysis with traditional uniform privacy allocation reveals that our approach better preserves cluster cohesion and separation, resulting in superior performance in clustering tasks. Experiments conducted on healthcare datasets demonstrate the efficacy of the proposed strategy in achieving robust privacy guarantees with minimal impact on clustering utility, making it suitable for sensitive data analysis scenarios.

Version published to 10.21203/rs.3.rs-6086086/v1 on Research Square
May 14, 2025

A Privacy-Preserving Rule Fusion Approach for Uncertainty-Aware Decision-Making in Posture Detection

This article has 3 authors:
1. Barbara Pękala
2. Anna Wilbik
3. Dorota Gil
This article has no evaluationsLatest version Feb 1, 2026
Fair Client Selection Method for Federated Learning Based on Discretized Firefly Algorithm

This article has 4 authors:
1. XiaoYe Li
2. Yangyang Zhang
3. Zhenlong Sun
4. Wei Zhao
This article has no evaluationsLatest version Dec 29, 2025
Entropy-Based Adaptive Ratio Estimators in Stratified Sampling Using Information Theory Measures with Empirical and Simulation Evidence

This article has 2 authors:
1. Anchal Yadav
2. Mukesh Kumar
This article has no evaluationsLatest version Jan 28, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

A Privacy-Preserving Rule Fusion Approach for Uncertainty-Aware Decision-Making in Posture Detection

Fair Client Selection Method for Federated Learning Based on Discretized Firefly Algorithm

Entropy-Based Adaptive Ratio Estimators in Stratified Sampling Using Information Theory Measures with Empirical and Simulation Evidence