A systematic review and meta-analysis of diagnostic performance comparison between generative AI and physicians

Hirotaka Takita
Daijiro Kabata
Shannon L. Walston
Hiroyuki Tatekawa
Kenichi Saito
Yasushi Tsujimoto
Yukio Miki
Daiju Ueda

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

While generative artificial intelligence (AI) has shown potential in medical diagnostics, comprehensive evaluation of its diagnostic performance and comparison with physicians has not been extensively explored. We conducted a systematic review and meta-analysis of studies validating generative AI models for diagnostic tasks published between June 2018 and June 2024. Analysis of 83 studies revealed an overall diagnostic accuracy of 52.1%. No significant performance difference was found between AI models and physicians overall ( p = 0.10) or non-expert physicians ( p = 0.93). However, AI models performed significantly worse than expert physicians ( p = 0.007). Several models demonstrated slightly higher performance compared to non-experts, although the differences were not significant. Generative AI demonstrates promising diagnostic capabilities with accuracy varying by model. Although it has not yet achieved expert-level reliability, these findings suggest potential for enhancing healthcare delivery and medical education when implemented with appropriate understanding of its limitations.

Version published to 10.1038/s41746-025-01543-z
Mar 22, 2025
Version published to 10.1101/2024.01.20.24301563 on medRxiv
Jan 22, 2024

A Scoping Review of Generative AI in Mental Health Support

This article has 20 authors:
1. Richard Gaus
2. Felix Gross
3. Maxim Korman
4. Fiona Klaassen
5. Simona Maspero
6. Luca Martignoni
7. Maria F. Urquijo
8. Sabrina Boger
9. Tarek Jebrini
10. Johannes Wolf
11. Paul Hager
12. Elizabeth Cameron Stade
13. Yannik Terhorst
14. Jana Volkert
15. Joseph Kambeitz
16. Hans C. Stubbe
17. Frank Padberg
18. Shannon Wiltsey Stirman
19. Nikolaos Koutsouleris
20. johannes Christopher Eichstaedt
This article has no evaluationsLatest version Dec 16, 2025
A Scoping Review of Generative AI in Mental Health Support

This article has 20 authors:
1. Richard Gaus
2. Felix Gross
3. Maxim Korman
4. Fiona Klaassen
5. Simona Maspero
6. Luca Martignoni
7. Maria F. Urquijo
8. Sabrina Boger
9. Tarek Jebrini
10. Johannes Wolf
11. Paul Hager
12. Elizabeth Cameron Stade
13. Yannik Terhorst
14. Jana Volkert
15. Joseph Kambeitz
16. Hans C. Stubbe
17. Frank Padberg
18. Shannon Wiltsey Stirman
19. Nikolaos Koutsouleris
20. johannes Christopher Eichstaedt
This article has no evaluationsLatest version Dec 16, 2025
An Overview of Existing Applications of Artificial Intelligence in Histopathological Diagnostics of Lymphoma: A Scoping Review

This article has 7 authors:
1. Mieszko Czapliński
2. Grzegorz Redlarski
3. Mateusz Wieczorek
4. Paweł Kowalski
5. Piotr Mateusz Tojza
6. Adam Sikorski
7. Arkadiusz Żak
This article has no evaluationsLatest version Jan 16, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

A Scoping Review of Generative AI in Mental Health Support

A Scoping Review of Generative AI in Mental Health Support

An Overview of Existing Applications of Artificial Intelligence in Histopathological Diagnostics of Lymphoma: A Scoping Review