Can Artificial Intelligence Evaluate Online Health Information? A Comparative Assessment of Scabies-Related YouTube Videos Using Human Experts and ChatGPT-5

Bengisu MERAL KETENCI
ozge sevil KARSTARLI BAKAY

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Scabies is a highly contagious parasitic skin disease for which patients frequently seek information on YouTube, although the reliability of available content is uncertain. Artificial intelligence (AI), particularly large language models, has emerged as a potential tool for assessing online health information; however, its concordance with expert evaluation remains unclear. This cross-sectional study analyzed the first 50 English-language YouTube videos retrieved using the term “scabies disease.” Two dermatology specialists independently evaluated videos using the DISCERN instrument, JAMA benchmark criteria, and the Global Quality Scale (GQS). Video characteristics were recorded, and sources were classified as professional or non-professional. Corrected transcripts were analyzed with ChatGPT-5 to generate Accuracy and Completeness scores. Readability was assessed using Flesch Reading Ease and Flesch–Kincaid Grade Level. Professionally produced videos scored significantly higher than non-professional videos across all human-based quality measures (p < 0.01). AI-generated scores were also higher for professional content but showed only moderate correlation with expert assessments. ChatGPT Completeness demonstrated moderate discrimination in identifying higher-quality videos (AUC = 0.668). Overall, AI reflected general quality trends but did not replicate expert judgment, suggesting a complementary rather than substitutive role.

Version published to 10.21203/rs.3.rs-8924512/v1 on Research Square
Mar 16, 2026

Readability, Quality, Understandability, and Actionability of ChatGPT Generated GI Patient Education vs AGA Patient Center

This article has 4 authors:
1. Shivam Chandra
2. Vineet Kumar
3. Robert Kwei-Nsoro
4. Anas Almoghrabi
This article has no evaluationsLatest version Apr 17, 2026
Evaluation of the Quality and Reliability of YouTube Videos as a Source of Information on Oral Health During Pregnancy: A Cross-Sectional Content Analysis

This article has 4 authors:
1. Nazan Kocak Topbas
2. Halil Altun
3. Kübra Albayrak
4. Fatıma Sümeyra Çetin
This article has no evaluationsLatest version Mar 30, 2026
Quality and Reliability of Down Syndrome Educational Videos on TikTok and Bilibili: A Cross-Sectional Analysis

This article has 6 authors:
1. Yaqin Zhang
2. Zongkai Bai
3. Han Yang
4. Rui Li
5. Jinfeng Wang
6. Yiying Chen
This article has no evaluationsLatest version Apr 13, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Readability, Quality, Understandability, and Actionability of ChatGPT Generated GI Patient Education vs AGA Patient Center

Evaluation of the Quality and Reliability of YouTube Videos as a Source of Information on Oral Health During Pregnancy: A Cross-Sectional Content Analysis

Quality and Reliability of Down Syndrome Educational Videos on TikTok and Bilibili: A Cross-Sectional Analysis