AI Chatbots Versus Human Healthcare Professionals: A Systematic Review and Meta-Analysis of Empathy in Patient Care

Alastair Howcroft
Amber Bennett-Weston
Ahmad Khan
Joseff Griffiths
Simon Gay
Jeremy Howick

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Background

Empathy is widely recognised for improving patient outcomes ranging from reduced pain and anxiety to improved patient satisfaction, and its absence can cause harm. Meanwhile, use of artificial intelligence (AI)–based chatbots in healthcare is rapidly expanding, with one in five general practitioners (GPs) using generative AI to assist with tasks such as writing letters. Several studies suggest that these AI-based technologies are sometimes more empathic than human healthcare professionals (HCPs). However, the evidence in this area is mixed and has not been synthesised.

Objective

To conduct a systematic review of studies that compare empathy of AI technologies with human HCP empathy.

Methods

We searched multiple databases for studies comparing AI chatbots using large language models (e.g., GPT-3.5, GPT-4) with human HCPs on empathy measures. We assessed risk of bias with ROBINS-I and synthesised findings using random-effects meta-analysis where feasible, whilst avoiding double counting.

Results

Our search identified 15 studies (2023–2024). Thirteen studies reported statistically significantly higher empathy ratings for AI, with only two studies situated in dermatology favouring human responses. Meta-analysis of 13 studies with data suitable for pooling, all utilising ChatGPT-3.5/4, showed a standardised mean difference (SMD) of 0.87 (95% CI, 0.54–1.20) favouring AI (p<0.00001).

Conclusion

Our findings indicate that, in text-only scenarios, AI chatbots are frequently perceived as more empathic than human HCPs – equivalent to an increase of approximately two points on a 10-point empathy scale. Future research should validate these findings with direct patient evaluations and assess whether emerging voice-enabled AI systems can deliver similar empathic advantages.

Version published to 10.1101/2025.06.09.25329258 on medRxiv
Jun 9, 2025

The Illusion of Empathy: Why Users Distrust GPT-4 Chatbots for Mental Health Screenings

This article has 1 author:
1. Tom Bielen
This article has no evaluationsLatest version Jul 12, 2025
Evaluating User Experiences with an AI Chatbot for Health-Related Social Needs: A Cross-Sectional Mixed Methods Study

This article has 10 authors:
1. Emre Sezgin
2. Daniel I. Jackson
3. Syed-Amad Hussain
4. A. Baki Kocaballi
5. Carson Richardson
6. Micah Skeens
7. Eric Fosler-Lussier
8. Aldenise P. Ewing
9. Macarius Donneyong
10. Ahna Pai
This article has no evaluationsLatest version Jun 17, 2025
A typology of physician input approaches to using AI chatbots for clinical decision-making: a mixed methods study

This article has 13 authors:
1. Rachel Siden
2. Hannah Kerman
3. Robert J. Gallo
4. Joséphine A. Cool
5. Jason Hom
6. Ethan Goh
7. Neera Ahuja
8. Paul Heidenreich
9. Lisa Shieh
10. Daniel Yang
11. Jonathan H Chen
12. Adam Rodman
13. Laura M Holdsworth
This article has no evaluationsLatest version Jul 23, 2025

Listed in

Abstract

Background

Objective

Methods

Results

Conclusion

Article activity feed

Related articles

The Illusion of Empathy: Why Users Distrust GPT-4 Chatbots for Mental Health Screenings

Evaluating User Experiences with an AI Chatbot for Health-Related Social Needs: A Cross-Sectional Mixed Methods Study

A typology of physician input approaches to using AI chatbots for clinical decision-making: a mixed methods study