Reliability, Accuracy, and Comprehensibility of AI-Based Responses to Common Patient Questions Regarding Spinal Cord Stimulation

Giuliano Lo Bianco
Marco Cascella
Sean Li
Miles Day
Leonardo Kapural
Christopher L. Robinson
Emanuele Sinagra

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Background: Although spinal cord stimulation (SCS) is an effective treatment for managing chronic pain, many patients have understandable questions and concerns regarding this therapy. Artificial intelligence (AI) has shown promise in delivering patient education in healthcare. This study evaluates the reliability, accuracy, and comprehensibility of ChatGPT’s responses to common patient inquiries about SCS. Methods: Thirteen commonly asked questions regarding SCS were selected based on the authors’ clinical experience managing chronic pain patients and a targeted review of patient education materials and relevant medical literature. The questions were prioritized based on their frequency in patient consultations, relevance to decision-making about SCS, and the complexity of the information typically required to comprehensively address the questions. These questions spanned three domains: pre-procedural, intra-procedural, and post-procedural concerns. Responses were generated using GPT-4.0 with the prompt “If you were a physician, how would you answer a patient asking…”. Responses were independently assessed by 10 pain physicians and two non-healthcare professionals using a Likert scale for reliability (1–6 points), accuracy (1–3 points), and comprehensibility (1–3 points). Results: ChatGPT’s responses demonstrated strong reliability (5.1 ± 0.7) and comprehensibility (2.8 ± 0.2), with 92% and 98% of responses, respectively, meeting or exceeding our predefined thresholds. Accuracy was 2.7 ± 0.3, with 95% of responses rated sufficiently accurate. General queries, such as “What is spinal cord stimulation?” and “What are the risks and benefits?”, received higher scores compared to technical questions like “What are the different types of waveforms used in SCS?”. Conclusions: ChatGPT can be implemented as a supplementary tool for patient education, particularly in addressing general and procedural queries about SCS. However, the AI’s performance was less robust in addressing highly technical or nuanced questions.

Version published to 10.3390/jcm14051453
Feb 21, 2025
Version published to 10.20944/preprints202501.1218.v1
Jan 16, 2025

Evaluating ChatGPT-4o’s Web-Enhanced Responses in Patient Education: Ankle Stabilization Surgery as a Case Study

This article has 4 authors:
1. Mi Zhou
2. Xiaomei Song
3. Qin Hu
4. Youbin Zhou
This article has no evaluationsLatest version Feb 17, 2025
Applications of Artificial Intelligence in Neurosurgical Education: A Scoping Review

This article has 2 authors:
1. Hector Julio Piñera-Castro
2. Christian Borges-García
This article has no evaluationsLatest version Feb 7, 2025
Deep Brain Sound Stimulation for Depression: Precision Auditory Stimulation

This article has 1 author:
1. Ugur Dogan
This article has no evaluationsLatest version Jan 30, 2025

Listed in

Abstract

Article activity feed

Related articles

Evaluating ChatGPT-4o’s Web-Enhanced Responses in Patient Education: Ankle Stabilization Surgery as a Case Study

Applications of Artificial Intelligence in Neurosurgical Education: A Scoping Review

Deep Brain Sound Stimulation for Depression: Precision Auditory Stimulation