Research on Citizen Privacy Risk Assessment Method Based on Retrieval-Augmented Generation

Jingye Qu
Fujian Luo

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

The issue of severe infringement on citizens' personal privacy makes it particularly crucial to assess the severity of personal information leaks. Current privacy risk assessment methods suffer from low evaluation efficiency and accuracy. To address these challenges, this paper proposes a research framework for privacy risk assessment based on Retrieval-Augmented Generation (RAG). First, we analyze privacy risk factors in cases of personal information infringement, construct an evaluation indicator system, and create a fine-tuned dataset and knowledge graph. Next, we propose a fine-tuning method dynamically adjusting the LoRA rank, which automatically contracts matrix dimensions by monitoring loss/gradient changes, reducing GPU memory consumption while maintaining generation quality. Finally, we introduce Retrieval-Augmented Generation (RAG), integrating internal knowledge from fine-tuned LLMs with weighted external evidence through joint reasoning to achieve more reliable judgments and privacy risk assessments. Compared to traditional approaches, this method demonstrates improved accuracy and text matching, validating its effectiveness. We propose a novel privacy risk assessment method based on Retrieval-Augmented Generation, mitigating the hallucination issues of large language models while enhancing the accuracy and efficiency of privacy risk evaluation tasks.

Version published to 10.21203/rs.3.rs-9015963/v1 on Research Square
Apr 10, 2026

Beyond Disclosure: Reframing Privacy as Inference Impedance in Large Language Models

This article has 1 author:
1. Yair Oppenheim
This article has no evaluationsLatest version Mar 3, 2026
Persistence Landscapes Across Privacy Budgets for Explanation Methods Across Differential Privacy Mechanisms

This article has 1 author:
1. Paul Zheng
This article has no evaluationsLatest version Feb 16, 2026
Mitigating text data privacy risks from gradient and model inversion attacks with a dual-pronged defense

This article has 2 authors:
1. Yuxin Xie
2. Ying Gao
This article has no evaluationsLatest version Apr 7, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Beyond Disclosure: Reframing Privacy as Inference Impedance in Large Language Models

Persistence Landscapes Across Privacy Budgets for Explanation Methods Across Differential Privacy Mechanisms

Mitigating text data privacy risks from gradient and model inversion attacks with a dual-pronged defense