Impact of LLM Assistance on Physician Decision-Making: A Multi-Country Randomized Controlled Trial ^∗

Nicholas Rounding
Luthfi Saiful Arif
Janine Berg
Jochen Cals
Diederik De Boer
Eefje De Bont
Sander Dijksman
Ardi Findyartini
Didier Fouarge
Marie-Christine Fregin
Pawel Gmyrek
Nadia Greviana
Ralph Leijenaar
Soraiya Manji
Annastacia Mbithi
Norah Obungu
Arierta Pujitresnani
Roselyter Rianga
Diantha Soemantri
Sairabanu Mohamed Rashid Sokwalla
Sanne Steens
Lucia Velasco
Ardy Wildan
Prasandhya Astagiri Yusuf
Mark Levels

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Disparities in the quality of healthcare persist globally, with poor-quality care contributing significantly to preventable mortality, particularly in low- and middle-income countries. While digital technologies, including generative artificial intelligence (AI), hold promise for improving clinical decision-making, their global effectiveness and potential to mitigate cross-country variation remain underexplored. We conducted a parallel-group randomized controlled trial across three economically diverse countries—Indonesia, Kenya, and the Netherlands—to evaluate the impact of large language model (LLM) access on physician performance using standardized clinical vignettes. Physicians (N=249) were randomly assigned to either a control group or an intervention group with access to GPT-4o. Results showed that LLM access significantly improved clinical performance, with the largest effect in Kenya (18%, 95% CI: 12.7 to 23.2, p < 0.001), followed by Indonesia (10.7%, 95% CI: 5.7 to 15.7, p < 0.001) and the Netherlands (7.2%, 95% CI: 3.7 to 10.7, p < 0.001). Notably, LLM access reduced cross-country performance disparities, particularly between Kenya and the Netherlands. However, distributional effects varied, with increased score dispersion in Indonesia and reduced variation in Kenya. Higher LLM usage was associated with greater performance gains, though some physicians without access outperformed those with access, suggesting that effective use depends on individual engagement. Our findings demonstrate that LLMs can enhance clinical performance across diverse settings while potentially narrowing global inequalities in care quality. Further research should explore mechanisms of effective LLM integration and long-term impacts on real-world clinical practice.

Version published to 10.1101/2025.08.08.25333272 on medRxiv
Aug 12, 2025

Appropriateness and Utility of a Clinical Decision Support System at the Digital Front Door

This article has 11 authors:
1. Andreia Pimenta
2. Nisha Kini
3. Fabienne Cotte
4. Filipa Dias Lourenço
5. Miguel Paiva Pereira
6. Marcel Schmude
7. Athena Lemesiou
8. Stephen Gilbert
9. Tauseef Mehrali
10. Micaela Seemann Monteiro
11. Pedro Flores
This article has no evaluationsLatest version Jan 8, 2026
Cost-effectiveness of a shared decision-making intervention for patients receiving treatments for cardiovascular disease and diabetes in primary care: a cluster randomised controlled trial based on real-world data (IMA-cRCT study)

This article has 8 authors:
1. Maria Rubio-Valera
2. Alba Sánchez-Viñas
3. Carmen Corral-Partearroyo
4. María Teresa Peñarrubia-María
5. Montserrat Gil-Girbau
6. María Carmen Olmos-Palenzuela
7. Carmen Gallardo-González
8. Ignacio Aznar-Lou
This article has no evaluationsLatest version Dec 22, 2025
Inequities in Healthcare-Associated Infections Across North America- A Systematic Review

This article has 1 author:
1. BDS MPH ScD(c) Chandni Shahdev
This article has no evaluationsLatest version Dec 30, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Appropriateness and Utility of a Clinical Decision Support System at the Digital Front Door

Cost-effectiveness of a shared decision-making intervention for patients receiving treatments for cardiovascular disease and diabetes in primary care: a cluster randomised controlled trial based on real-world data (IMA-cRCT study)

Inequities in Healthcare-Associated Infections Across North America- A Systematic Review