Clinically grounded multi-agent artificial intelligence for preventive health management

Hao Lin
Yang Zhang
Dongxin Ye
Sicheng He
Zhaowu Du
Yang Yu
Xiao Yu
Liping Ren
Nanqing Dong
Fang Hu
Jinsong Su
Jie Zhang
Yun Tan
Li Zhao

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Routine health examinations generate dense, heterogeneous data, yet their preventive value depends on consistent interpretation, calibrated risk stratification, and actionable follow-up. In practice, these tasks are distributed across clinicians and time, leading to variability in the detection of subtle abnormalities and in decisions about when and how to intervene. Such variability reflects the difficulty of maintaining consistent, high-quality preventive decision-making at scale. Here we present G-Health, a clinically grounded multi-agent artificial intelligence framework that translates examination reports into structured preventive action. The system combines three-stage clinical alignment of large language models with specialist quantitative risk models and guideline-informed retrieval to stabilize reasoning under uncertainty. Trained on large-scale medical dialogue data and further specialized on multi-center real-world examination reports, the framework integrates 20 quantitative risk models that provide calibrated multi-disease estimates with feature-level interpretability. Across 13 medical and general benchmarks, the aligned models achieve the best overall average rank among strong baselines. In a fully blinded evaluation involving 79 medically trained assessors, G-Health reports were consistently preferred over outputs from three other large language models and 12 senior practicing physicians across five clinical dimensions. Together, these findings establish a deployable paradigm that transforms routine examinations into structured and scalable preventive decision-making.

Version published to 10.21203/rs.3.rs-9150301/v1 on Research Square
Apr 9, 2026

The Inefficacy of Artificial Intelligence Large Language Models in Healthcare: A Clinical and Statistical Perspective

This article has 4 authors:
1. Michael Williams
2. Raeed Kabir
3. Cody Taylor
4. Tariq Nakhooda
This article has no evaluationsLatest version Apr 27, 2026
The Inefficacy of Artificial Intelligence Large Language Models in Healthcare: A Clinical and Statistical Perspective

This article has 4 authors:
1. Michael Williams
2. Raeed Kabir
3. Cody Taylor
4. Tariq Nakhooda
This article has no evaluationsLatest version Apr 27, 2026
Interpretable Predictive Modeling for Medical Data Using Boolean Rule-aware Regression

This article has 2 authors:
1. Mohammad Eskandarian
2. Seyed Amir Malekpour
This article has no evaluationsLatest version May 18, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

The Inefficacy of Artificial Intelligence Large Language Models in Healthcare: A Clinical and Statistical Perspective

The Inefficacy of Artificial Intelligence Large Language Models in Healthcare: A Clinical and Statistical Perspective

Interpretable Predictive Modeling for Medical Data Using Boolean Rule-aware Regression