MAP-CARE: Enhancing Cross-Lingual Medical Intervention Terms Analysis Through LLM-supported Semantic Embeddings
Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
Background: Cross-lingual information retrieval limits global exchange of data because of the high diversity in the methods to classify, document and encode medical procedures. Traditional keyword-based or single-language systems are not able to align data from surgical and interventional procedures, especially from non-English healthcare systems. This study aims to develop a pipeline for cross-lingual retrieval and integration of medical procedures data. Results: MAP-CARE is a novel framework that leverages Large Language Models (LLMs) for translating and transforming medical procedures into a unified multilingual embedding space. S emantic embeddings are used to enhance retrieval accuracy and interoperability across languages and healthcare systems. MAP-CARE demonstrated high accuracy in the translation and mapping of clinical terms. Its cross-language translation performance proved robust, achieving up to 90% accuracy in translating procedure classification codes across English, German, French, and Italian—when considering the correct term among the top five retrieved. The cross-classification mapping workflow also showed high accuracy in aligning two different national procedure classifications, with exact and near matches exceeding 53.8% at the most granular level. Conclusion: MAP-CARE offers a flexible, scalable, and robust solution for the multilingual and cross-system integration of medical procedural data. Its innovative use of large language models (LLMs) combined with semantic embeddings sets a new standard for the accessibility and utility of multilingual medical information. The framework is designed for easy extension from a terminology file in CSV format and is publicly available [1].