Privacy Protection for Chinese Electronic Medical Records Using Large Language Models: Effectiveness Evaluation and Application of LLM Models in Medical Data Tasks

Gong Mengchun
Ouyang Zihao
Ma Dandan
Cai Endi
Liu Chao
Shi Wenzhao
Zhang Bohan
Ma Lian
Wei Yuna
Jiang Huizhen
Zhou Xiang

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Background

The privacy protection of medical patients has remained a critical concern in healthcare information management during the digital era. Conventional approaches have predominantly relied on rule-based protocols and data encryption systems, which typically require substantial involvement of IT professionals for implementation. Recent advancements in Large Language Models (LLMs) have introduced novel approaches for electronic medical records (EMRs) privacy protection, simultaneously enabling clinical practitioners to utilize these tools for specific data tasks.

Objectives

This study aims to leverage LLMs through a no-code framework to achieve structured processing of patient privacy data in Chinese EMRs and formulate privacy policies, while evaluating the practical efficacy of LLMs.

Methods

This study employs a disease-specific data subset from Peking Union Medical College Hospital (PUMCH), comprising data from approximately 160,000 patients, using a prompt engineering approach to enable LLMs to perform sensitive information annotation in lengthy EMR narratives. Simultaneously, it automates the classification of privacy-level for identified sensitive data and develops targeted protection strategies based on risk tiers, thereby mitigating non-essential exposure of patient privacy during data sharing. The research utilizes the Qwen model, with its entire workflow being exclusively driven by medical natural language prompts and self-evolving knowledge bases, requiring no supplementary programming or code development. These strategies were validated using the hospital’s test text dataset, with primary evaluation metrics focusing on precision rates (including accuracy of information extraction and privacy-level classification) and recall rate assessments for critical sensitive data categories.

Results

Utilizing 4 million text entries from PUMCH, we conducted sampled data observation and performed privacy annotation via LLM prompts across seven categories: names, addresses, contact details, national ID numbers, hospital names, sexually transmitted disease (STD) information, and pregnancy-related patient data. Through iterative prompt refinement via error analysis, optimal performance was achieved on the test set, demonstrating an average precision of 97% and recall of 95% across these seven entity types. Furthermore, sensitivity tier classification was implemented for three high-risk categories: addresses, STD information, and pregnancy-related data, attaining average precision of 95% and recall of 90% in sensitivity-level determination.

Discussion

We propose a novel codeless privacy protection framework leveraging LLMs, enabling intelligent anonymization of medical data through natural language interaction. This solution employs a three-tiered hierarchical protection mechanism that dynamically adapts privacy strategies to clinical scenario requirements, ensuring data security while maximizing data utility.

Version published to 10.1101/2025.07.27.25332177 on medRxiv
Jul 28, 2025

SPELL-LLMs: A Scalable and Privacy-Compliant NLP Pipeline Using Locally Hosted Large Language Models for Clinical Information Extraction

This article has 4 authors:
1. Ricardo Kleinlein
2. Kathryn J. Gray
3. David Bates
4. Vesela P. Kovacheva
This article has no evaluationsLatest version Jul 25, 2025
Simplification and Translation of Medical Reports Using Large Language Models-A Protocol for the Indian Context

This article has 2 authors:
1. Bhavin Jain
2. Sandeep Reddy
This article has no evaluationsLatest version Aug 13, 2025
Automated De-Identification, Consistent Obfuscation, and Regulatory Grade Validation of 2 Billion Patient Notes

This article has 9 authors:
1. Veysel Kocaman
2. Lindsay Mico
3. Mustafa Aytug Kaya
4. Nadaa Taiyab
5. David Talby
6. Tae Surh
7. Yuqing Guo
8. Vivek Tomer
9. Robert Kramer
This article has no evaluationsLatest version Sep 5, 2025

Listed in

Abstract

Background

Objectives

Methods

Results

Discussion

Article activity feed

Related articles

SPELL-LLMs: A Scalable and Privacy-Compliant NLP Pipeline Using Locally Hosted Large Language Models for Clinical Information Extraction

Simplification and Translation of Medical Reports Using Large Language Models-A Protocol for the Indian Context

Automated De-Identification, Consistent Obfuscation, and Regulatory Grade Validation of 2 Billion Patient Notes