Optimizing K-Means Clustering with Privacy Budget Allocation Based on Variance and Sensitivity

Read the full article See related articles

Listed in

This article is not in any list yet, why not save it to one of your lists.
Log in to save this article

Abstract

This paper presents a novel approach for enhancing k-means clustering through a privacy-preserving budget allocation mechanism based on variance and sensitivity analysis. The proposed method aims to balance the trade-off between data utility and privacy preservation by selectively allocating privacy budgets across features, emphasizing features with higher variance and lower sensitivity to maintain clustering accuracy. We employ differential privacy techniques, particularly the Laplace mechanism, to introduce controlled noise, protecting user data while minimizing information loss. Comparative analysis with traditional uniform privacy allocation reveals that our approach better preserves cluster cohesion and separation, resulting in superior performance in clustering tasks. Experiments conducted on healthcare datasets demonstrate the efficacy of the proposed strategy in achieving robust privacy guarantees with minimal impact on clustering utility, making it suitable for sensitive data analysis scenarios.

Article activity feed