A Clinician-Led Governance Framework for Evaluating Behavioral-Health AI Communication Safety

Dione Hutchinson

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Conversational artificial intelligence systems are increasingly used in behavioral-health contexts, where users frequently express emotional distress, uncertainty, and culturally nuanced needs. Although general AI governance frameworks provide high-level principles for responsible development, they do not offer domain-specific mechanisms for evaluating the safety and appropriateness of behavioral-health AI communication. This study introduces a clinician-led governance framework designed to evaluate communication-level safety in behavioral-health AI systems. The framework includes ten interdependent governance domains constructed through clinical analysis, socio-technical review, and iterative conceptual refinement. Inter-rater reliability (IRR) procedures were conducted using trained behavioral-health reviewers scoring a standardized set of AI outputs across all domains. Agreement metrics demonstrated substantial to near-perfect consistency. Across domains, Cohen’s κ values ranged from moderate (0.64) to near-perfect agreement (0.89). This work addresses an urgent oversight gap in AI governance by providing a structured, clinically informed method for evaluating the safety of behavioral-health AI communication.

Version published to 10.21203/rs.3.rs-8160630/v1 on Research Square
Feb 2, 2026

From experimentation to adoption: a normative ethical analysis of large language models in health care

This article has 9 authors:
1. Xiongwen Yang
2. Yuanwei Liang
3. Yijiang Liu
4. Di Liu
5. Lin Yang
6. Bo Zhang
7. Chuan Xu
8. YongHui Wu
9. Jinyuan He
This article has no evaluationsLatest version Jan 23, 2026
Clinical Cyberbioethics and AI-Mediated Clinical Decision-Making: a mapping and narrative review with thematic synthesis (SWiM)

This article has 2 authors:
1. Anderson Díaz Pérez
2. Wendy Acuña Pérez
This article has no evaluationsLatest version Jan 16, 2026
Human-Centered Pathways to Trustworthy AI in Healthcare: A Comparative Analysis of Explainable AI, Human-in-the-Loop, Hybrid AI, and Uncertainty Quantification Techniques

This article has 9 authors:
1. Ali Kohan
2. Junjie Xu
3. Luwei Xiao
4. Xingjiao Wu
5. Ashima Kukkar
6. Sadiq Hussain
7. Mohamad Roshanzamir
8. Roohallah Alizadehsani
9. U. Rajendra Acharya
This article has no evaluationsLatest version Feb 27, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

From experimentation to adoption: a normative ethical analysis of large language models in health care

Clinical Cyberbioethics and AI-Mediated Clinical Decision-Making: a mapping and narrative review with thematic synthesis (SWiM)

Human-Centered Pathways to Trustworthy AI in Healthcare: A Comparative Analysis of Explainable AI, Human-in-the-Loop, Hybrid AI, and Uncertainty Quantification Techniques