MAP-CARE: Enhancing Cross-Lingual Medical Intervention Terms Analysis Through LLM-supported Semantic Embeddings

Read the full article See related articles

Listed in

This article is not in any list yet, why not save it to one of your lists.
Log in to save this article

Abstract

Background: Cross-lingual information retrieval limits global exchange of data because of the high diversity in the methods to classify, document and encode medical procedures. Traditional keyword-based or single-language systems are not able to align data from surgical and interventional procedures, especially from non-English healthcare systems. This study aims to develop a pipeline for cross-lingual retrieval and integration of medical procedures data. Results: MAP-CARE is a novel framework that leverages Large Language Models (LLMs) for translating and transforming medical procedures into a unified multilingual embedding space. S emantic embeddings are used to enhance retrieval accuracy and interoperability across languages and healthcare systems. MAP-CARE demonstrated high accuracy in the translation and mapping of clinical terms. Its cross-language translation performance proved robust, achieving up to 90% accuracy in translating procedure classification codes across English, German, French, and Italian—when considering the correct term among the top five retrieved. The cross-classification mapping workflow also showed high accuracy in aligning two different national procedure classifications, with exact and near matches exceeding 53.8% at the most granular level. Conclusion: MAP-CARE offers a flexible, scalable, and robust solution for the multilingual and cross-system integration of medical procedural data. Its innovative use of large language models (LLMs) combined with semantic embeddings sets a new standard for the accessibility and utility of multilingual medical information. The framework is designed for easy extension from a terminology file in CSV format and is publicly available [1].

Article activity feed