Evaluating the Efficacy of ChatGPT in Generating Assessment and Plans for Medical Notes in Urology

Ryan Trippel
Zach Palmisano
Tiffany Ho
William Fox
Luca Morgantini
Omer Acar
Mahmoud Mima

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Purpose The increase in medical documentation responsibilities has been implicated in increasing burnout rates of Urologists. AI chatbots like ChatGPT may help in decreasing the documentation workload of Urologists. This study tests the quality of Assessment and Plans created by ChatGPT compared to those created by residents. Methods: 11 fictional cases were submitted to ChatGPT-4 and to four residents at the University of Illinois with instructions to create an assessment and plan for each scenario. The responses were given to 2 attending physicians to grade for accuracy, clarity, and clinical reasoning using Likert-type scales. The graders noted whether false information was present and the perceived identify of the response’s author. The Mann-Whitney U test was used for statistical analysis of Likert-type data. Results: When compared to responses created by residents, ChatGPT had significantly higher scores for clarity, comprehensiveness, and soundness of clinical reasoning. There was no significant difference between the two groups in accuracy of diagnosis and accuracy of the treatment plan. The evaluators misidentified the authors identity in 13.64% of responses. Conclusion: This study aimed to compare the abilities of ChatGPT compared to residents in generating assessment and plans of clinical encounters. The results indicate that ChatGPT is superior in the tested domains when compared to residents.

Version published to 10.21203/rs.3.rs-5427610/v1 on Research Square
Dec 19, 2024

Ai Chatbots for Pediatric Fluoride Education: An Effectiveness Study

This article has 3 authors:
1. Nevra Karamüftüoğlu
2. Ezgi Aydın Varol
3. Cenkhan Bal
This article has no evaluationsLatest version Aug 11, 2025
Evaluating ChatGPT Responses to Frequently Asked Questions on Total Knee Arthroplasty

This article has 9 authors:
1. Yilun Jiang
2. Jiesheng Zhu
3. Yuanyuan Lin
4. Zheng Su
5. Libing Zhang
6. Zhen Dong
7. Qiong Song
8. Pei Fan
9. Zhenxing Li
This article has no evaluationsLatest version Jul 23, 2025
Use of and Confidence in YouTube and ChatGPT in Surgical Teaching and Training

This article has 4 authors:
1. Stefan Tajsic
2. Imre Stjern-Vik
3. Dana Meknas
4. Martin Bruusgaard Harbitz
This article has no evaluationsLatest version Aug 21, 2025

Listed in

Abstract

Article activity feed

Related articles

Ai Chatbots for Pediatric Fluoride Education: An Effectiveness Study

Evaluating ChatGPT Responses to Frequently Asked Questions on Total Knee Arthroplasty

Use of and Confidence in YouTube and ChatGPT in Surgical Teaching and Training