Feeding Intelligence: Comparative Evaluation of ChatGPT and Clinical Guidelines for Nutritional Management in Head and Neck Cancer
Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
Background Artificial intelligence (AI) tools such as ChatGPT are increasingly applied in digital health and patient education, yet their alignment with established clinical guidelines for cancer-related nutritional management remains unclear. Objective This study aimed to evaluate the concordance, functional characteristics, patient accessibility, and innovation of ChatGPT-generated nutritional recommendations compared with clinical guidelines from the Chinese Society of Clinical Oncology (CSCO), Chinese Nutrition Society (CNS), and European Society for Clinical Nutrition and Metabolism (ESPEN). Methods We analyzed ChatGPT responses across six key nutrition-related issues—anorexia/cachexia, dysphagia, oral mucositis, unintentional weight loss, gastrointestinal intolerance, and nutritional monitoring—and compared them with guideline recommendations. Expert evaluation (n = 5), readability metrics, semantic similarity (TF-IDF), and patient-centered assessments were conducted to compare personalization, innovation, clinical feasibility, evidence-based support, population applicability, clarity, and self-management guidance. Results ChatGPT recommendations aligned with at least one guideline in 50.0–64.3% of cases, highest for dysphagia (64.3%), and included general strategies such as small frequent meals, texture modification, hydration, and high-protein/high-calorie intake. ChatGPT-specific suggestions (8.3–18.2%) focused on lifestyle and behavioral interventions, including mindful eating, music therapy, and wearable diet trackers. Expert ratings indicated higher personalization (4.3/5) and innovation (4.6/5) for ChatGPT, whereas guidelines scored higher for clinical feasibility (4.7/5), evidence-based support (4.9/5), and population applicability (4.8/5). ChatGPT exhibited superior patient-centered performance in clarity (4.5 vs 3.2) and self-management guidance (4.6 vs 3.0) and demonstrated more concise, readable content (Flesch–Kincaid grade 12.9–14.2) compared with guidelines (17.9–20.5). Semantic analysis revealed moderate overlap with CSCO (≈ 0.63) and CNS (≈ 0.59), and lower similarity with ESPEN (≈ 0.47), highlighting ChatGPT’s use of patient-friendly language. Topic modeling identified three clusters: patient support and accessibility (ChatGPT), technical nutrition therapy (ESPEN/CSCO), and nutritional assessment and monitoring (CNS). Conclusions ChatGPT provides personalized, innovative, and patient-accessible nutritional guidance for cancer-related malnutrition, complementing traditional clinical guidelines. While guidelines remain essential for evidence-based decision-making, AI tools may enhance patient education, engagement, and self-management in digital health applications.