Enhancing Data Privacy in Large Language Models through Private Association Editing

Davide Venditti
Elena Sofia Ruzzetti
Giancarlo A. Xompero
Cristina Giannone
Andrea Favalli
Raniero Romagnoli
Fabio Massimo Zanzotto

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Large language models (LLMs) require a significant redesign in solutions to preserve privacy in data-intensive applications due to their text-generation capabilities. Indeed, LLMs tend to memorize and emit private information when maliciously prompted. In this paper, we introduce Private Association Editing (PAE) as a novel defense approach for private data leakage. PAE is designed to effectively remove Personally Identifiable Information (PII) without retraining the model. Experimental results demonstrate the effectiveness of PAE with respect to alternative baseline methods. We believe PAE will serve as a critical tool in the ongoing effort to protect data privacy in LLMs, encouraging the development of safer models for real-world applications.

Version published to 10.20944/preprints202603.0339.v1
Mar 4, 2026

Beyond Disclosure: Reframing Privacy as Inference Impedance in Large Language Models

This article has 1 author:
1. Yair Oppenheim
This article has no evaluationsLatest version Mar 3, 2026
Privacy-Preserving Retrieval for Auditable Clinical Language Modeling on Real-World Radiology Data

This article has 3 authors:
1. Nuri Purswani
2. Viktor Schlegel
3. Anil Anthony Bharath
This article has no evaluationsLatest version Mar 8, 2026
Machine Unlearning in Large Language Models: A Survey of Challenges and Methods

This article has 5 authors:
1. Xiaming Tu
2. Tianqing Zhu
3. Zhenni Liu
4. Ping Xiong
5. Wanlei Zhou
This article has no evaluationsLatest version Mar 3, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Beyond Disclosure: Reframing Privacy as Inference Impedance in Large Language Models

Privacy-Preserving Retrieval for Auditable Clinical Language Modeling on Real-World Radiology Data

Machine Unlearning in Large Language Models: A Survey of Challenges and Methods