Preserving Privacy, Increasing Accessibility, and Reducing Cost: An On-Device Artificial Intelligence Model for Medical Transcription and Note Generation

Johnson Thomas
Ayush Mudgal
Wendao Liu
Nisten Tahiraj
Zeeshaan Mohammed
Dhruv Diddi

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Background

Clinical documentation represents a significant burden for healthcare providers, with physicians spending up to 2 hours daily on administrative tasks. Recent advances in large language models (LLMs) offer promising solutions, but privacy concerns and computational requirements limit their adoption in healthcare settings.

Objective

To develop and evaluate a privacy-preserving, on-device medical transcription system using a fine-tuned Llama 3.2 1B model capable of generating structured medical notes from medical transcriptions while maintaining complete data sovereignty entirely in the browser.

Methods

We fine-tuned a Llama 3.2 1B model using Parameter-Efficient Fine-Tuning (PEFT) with LoRA on 1,500 synthetic medical transcription-to-structured note pairs. The model was evaluated against the base Llama 3.2 1B on two datasets: 100 endocrinology transcripts and 140 modified ACI benchmark cases. Evaluation employed both statistical metrics (ROUGE, BERTScore, BLEURT) and LLM-as-judge assessments across multiple clinical quality dimensions.

Results

The fine-tuned OnDevice model demonstrated substantial improvements over the base model. On the ACI benchmark, ROUGE-1 scores increased from 0.346 to 0.496, while BERTScore F1 improved from 0.832 to 0.866. Clinical quality assessments showed marked reduction in major hallucinations (from 85 to 35 cases) and enhanced factual correctness (2.81 to 3.54 on 5-point scale). Similar improvements were observed on the internal evaluation dataset, with composite scores increasing from 3.13 to 4.43 (+41.5%).

Conclusions

Fine-tuning compact LLMs for medical transcription yields clinically meaningful improvements while enabling complete on-device browser deployment. This approach addresses key barriers to AI adoption in healthcare: privacy preservation, cost reduction, and accessibility for resource-constrained environments.

Version published to 10.1101/2025.07.01.25330679 on medRxiv
Jul 2, 2025

Privacy Protection for Chinese Electronic Medical Records Using Large Language Models: Effectiveness Evaluation and Application of LLM Models in Medical Data Tasks

This article has 11 authors:
1. Gong Mengchun
2. Ouyang Zihao
3. Ma Dandan
4. Cai Endi
5. Liu Chao
6. Shi Wenzhao
7. Zhang Bohan
8. Ma Lian
9. Wei Yuna
10. Jiang Huizhen
11. Zhou Xiang
This article has no evaluationsLatest version Jul 28, 2025
Enhancing Privacy-Preserving Deployable Large Language Models for Perioperative Complication Detection: A Targeted Strategy with LoRA Fine-tuning

This article has 10 authors:
1. Shaowei Gao
2. Xu Zhao
3. Lihui Chen
4. Junrong Yu
5. shuning Tian
6. Huaqiang Zhou
7. jingru Chen
8. Sizhe Long
9. Qiulan He
10. Xia Feng
This article has no evaluationsLatest version Jun 13, 2025
SPELL-LLMs: A Scalable and Privacy-Compliant NLP Pipeline Using Locally Hosted Large Language Models for Clinical Information Extraction

This article has 4 authors:
1. Ricardo Kleinlein
2. Kathryn J. Gray
3. David Bates
4. Vesela P. Kovacheva
This article has no evaluationsLatest version Jul 25, 2025

Listed in

Abstract

Background

Objective

Methods

Results

Conclusions

Article activity feed

Related articles

Privacy Protection for Chinese Electronic Medical Records Using Large Language Models: Effectiveness Evaluation and Application of LLM Models in Medical Data Tasks

Enhancing Privacy-Preserving Deployable Large Language Models for Perioperative Complication Detection: A Targeted Strategy with LoRA Fine-tuning

SPELL-LLMs: A Scalable and Privacy-Compliant NLP Pipeline Using Locally Hosted Large Language Models for Clinical Information Extraction