Systematic Review of Large Language Models for Patient Care: Current Applications and Challenges

Felix Busch
Lena Hoffmann
Christopher Rueger
Elon HC van Dijk
Rawen Kader
Esteban Ortiz-Prado
Marcus R Makowski
Luca Saba
Martin Hadamitzky
Jakob Nikolas Kather
Daniel Truhn
Renato Cuocolo
Lisa C Adams
Keno K Bressem

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

The introduction of large language models (LLMs) into clinical practice promises to improve patient education and empowerment, thereby personalizing medical care and broadening access to medical knowledge. Despite the popularity of LLMs, there is a significant gap in systematized information on their use in patient care. Therefore, this systematic review aims to synthesize current applications and limitations of LLMs in patient care using a data-driven convergent synthesis approach. We searched 5 databases for qualitative, quantitative, and mixed methods articles on LLMs in patient care published between 2022 and 2023. From 4,349 initial records, 89 studies across 29 medical specialties were included, primarily examining models based on the GPT-3.5 (53.2%, n=66 of 124 different LLMs examined per study) and GPT-4 (26.6%, n=33/124) architectures in medical question answering, followed by patient information generation, including medical text summarization or translation, and clinical documentation. Our analysis delineates two primary domains of LLM limitations: design and output. Design limitations included 6 second-order and 12 third-order codes, such as lack of medical domain optimization, data transparency, and accessibility issues, while output limitations included 9 second-order and 32 third-order codes, for example, non-reproducibility, non-comprehensiveness, incorrectness, unsafety, and bias. In conclusion, this study is the first review to systematically map LLM applications and limitations in patient care, providing a foundational framework and taxonomy for their implementation and evaluation in healthcare settings.

Version published to 10.1101/2024.03.04.24303733v1 on medRxiv
Mar 5, 2024

A Systematic Review of Testing and Evaluation of Healthcare Applications of Large Language Models (LLMs)

This article has 19 authors:
1. Suhana Bedi
2. Yutong Liu
3. Lucy Orr-Ewing
4. Dev Dash
5. Sanmi Koyejo
6. Alison Callahan
7. Jason A. Fries
8. Michael Wornow
9. Akshay Swaminathan
10. Lisa Soleymani Lehmann
11. Hyo Jung Hong
12. Mehr Kashyap
13. Akash R. Chaurasia
14. Nirav R. Shah
15. Karandeep Singh
16. Troy Tazbaz
17. Arnold Milstein
18. Michael A. Pfeffer
19. Nigam H. Shah
This article has no evaluationsLatest version May 7, 2024
A Systematic Review of ChatGPT and Other Conversational Large Language Models in Healthcare

This article has 8 authors:
1. Leyao Wang
2. Zhiyu Wan
3. Congning Ni
4. Qingyuan Song
5. Yang Li
6. Ellen Wright Clayton
7. Bradley A. Malin
8. Zhijun Yin
This article has no evaluationsLatest version Apr 27, 2024
All You Need Is Context: Clinician Evaluations of various iterations of a Large Language Model-Based First Aid Decision Support Tool in Ghana

This article has 6 authors:
1. Paulina Boadiwaa Mensah
2. Nana Serwaa Quao
3. Sesinam Dagadu
4. James Kwabena Mensah
5. Jude Domfeh Darkwah
6. Project Genie Clinician Evaluation Group [1]
This article has no evaluationsLatest version Apr 25, 2024

Listed in

Abstract

Article activity feed

Related articles

A Systematic Review of Testing and Evaluation of Healthcare Applications of Large Language Models (LLMs)

A Systematic Review of ChatGPT and Other Conversational Large Language Models in Healthcare

All You Need Is Context: Clinician Evaluations of various iterations of a Large Language Model-Based First Aid Decision Support Tool in Ghana