Privacy-PreservingLLM Middleware in LIS: Edge-Computing for Coagulation InterpretationUnder High-Dimensional Noise

Zihao Zhou

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Background: The integration of clinical decision support into primary careLaboratory Information Systems (LIS) is severely hindered by strictprivacy regulations and limited computing infrastructure. We propose aprivacy-preserving, edge-computing Large Language Model (LLM)middleware to interpret pre-analytical coagulation tests, addressingthe clinical constraints of data compliance, hardware poverty, andzero-tolerance for AI hallucinations. Methods: We developed a local Retrieval-Augmented Generation (RAG) architectureusing a 4-bit quantized Qwen2.5-7B model deployed via the Ollamaframework on a commercial 32\,GB RAM terminal without a dedicated GPU.To rigorously validate the system, we generated a 2{,}000-casesynthetic baseline of APTT tests and injected 30% semantic and 15%lexical noise to simulate real-world LIS inputs. Output safety wasstrictly enforced utilising Pydantic V2 schemas and a Tenacity-drivenself-reflection mechanism (maximum 3 retries). Results: The proposed LLM-RAG middleware demonstrated exceptional robustness,maintaining a Guideline Concordance Accuracy (\((ACC_{gc})\)) of 97.00%under high-dimensional noise, whereas the traditional rule-basedbaseline collapsed to 10.00%. The system successfully constrainedhallucinations, recording a Critical Violation Rate (CVR) of only1.50%. Clinical and hardware viability were proven with a throughputof 1.45\,RPS and an average latency of 1.38\,seconds (strictly belowthe 3.0\,s threshold), while achieving a Cohen's Kappa score of 0.88in an independent AI-as-Expert adjudication protocol. Conclusion: This edge-deployed LLM paradigm provides a highly reliable,zero-marginal-cost alternative for LIS modernisation, enabling secureand robust AI integration in resource-constrained medical environments.

Version published to 10.21203/rs.3.rs-9171815/v1 on Research Square
Mar 23, 2026

Privacy-Preserving Retrieval for Auditable Clinical Language Modeling on Real-World Radiology Data

This article has 3 authors:
1. Nuri Purswani
2. Viktor Schlegel
3. Anil Anthony Bharath
This article has no evaluationsLatest version Mar 8, 2026
Uncertainty Aware Llm Deidentification and Anonymization of Clinical Notes

This article has 5 authors:
1. Adrienne Kline
2. Nafiseh Ghaffar Nia
3. Tyler J. Smith
4. David Leibowitz
5. Douglas Johnston
This article has no evaluationsLatest version Mar 19, 2026
The Kernel Blindness Hypothesis: Investigating OS-Level Detectability of LLM Safety Mechanisms

This article has 2 authors:
1. Ata Kilic
2. Baris Celiktas
This article has no evaluationsLatest version Mar 24, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Privacy-Preserving Retrieval for Auditable Clinical Language Modeling on Real-World Radiology Data

Uncertainty Aware Llm Deidentification and Anonymization of Clinical Notes

The Kernel Blindness Hypothesis: Investigating OS-Level Detectability of LLM Safety Mechanisms