Closing the Paediatric Gap: Adult-Trained AI Generalises Robustly to Paediatric Coeliac Disease Diagnosis

Florian Jaeckle
Peter M. Gillett
Kathryn J. Kirkwood
Shonali Natu
James Y. H. Chan
Adrian C. Bateman
Mark J. Arends
Elizabeth J. Soilleux

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Coeliac disease (CD) diagnosis on duodenal biopsies is limited by interobserver variability. We have previously demonstrated pathologist-level performance with our artificial intelligence (AI) model for the histopathological diagnosis of adult CD, but not in paediatric practice. As paediatric CD screening programmes expand internationally, accurate and scalable diagnostic tools are needed. We investigated whether an AI model trained exclusively on adult whole-slide images (WSIs) can generalise to paediatric CD diagnosis across independent centres.

Methods

A training and validation dataset of 9,958 WSIs from 8,421 adult patients (961 CD) from five centres was used to develop an ensemble of multiple-instance learning models using features from a foundation model. Testing was performed on 708 consecutive paediatric patients (86 CD) from two centres (Edinburgh and Southampton) not included in training. Model calibration was assessed, and probability outputs were grouped into clinically interpretable categories.

Findings

In adult cross-validation, the AI model achieved an area under the receiver operating characteristic curve (AUC) of 98.7%, sensitivity of 84.9%, specificity of 99.0%, and negative predictive value (NPV) of 98.1%. On testing (paediatric) datasets, performance remained high (AUC 98.8%, sensitivity 80.2%, specificity 98.4%, NPV 97.3%). Restricting analysis to predictions outside the intermediate-probability range (predicted CD probability <10% or ≥65%; 85.3% of cases) improved sensitivity to 100% and specificity to 98.7%. No misclassifications were observed among high-confidence predictions (<2% or ≥85%; 66.0% of cases). The expected calibration error was 0.03. Performance improved significantly when biopsies from both duodenal sites (bulb [D1] and descending [D2/3]) were considered.

Interpretation

Our AI model, trained on adult biopsies, generalises to paediatric CD diagnosis across centres and scanner platforms. Well-calibrated probability outputs provide clinically interpretable measures of diagnostic confidence and could support safe identification of CD-negative biopsies within defined thresholds. These findings demonstrate the feasibility of applying adult-derived AI models in paediatric populations and reinforce the importance of multi-site (D1 & D2) biopsy sampling.

Version published to 10.64898/2026.06.04.26354889 on medRxiv
Jun 5, 2026

Interpretable machine learning for coeliac disease diagnosis: quantitative morphometry of duodenal biopsies

This article has 10 authors:
1. Rebekah Bryant
2. Jacobo Romero Diaz
3. Adam G. Scott
4. Aryan A. Sagdeo
5. Gabriella Z. Jenkins
6. Robert A. Richardson
7. James Y. H. Chan
8. Mark J. Arends
9. Elizabeth J. Soilleux
10. Florian Jaeckle
This article has no evaluationsLatest version Jun 3, 2026
Optimisation of steatotic liver disease screening algorithm for resource-poor settings using machine learning

This article has 7 authors:
1. Chamila Mettananda
2. Kaveesha Sivasumithran
3. Ranaweera L Lakmali
4. Anjalika Madhubhashini
5. Chamila Ranawaka
6. Arunasalam Pathmeswaran
7. Anuradha Dassanayake
This article has no evaluationsLatest version Jun 10, 2026
Validation of a Paediatric-Optimized Computer-Aided Detection System for Tuberculosis Using Bayesian Latent Class Analysis

This article has 10 authors:
1. Victory F. Edem
2. Schadrac C. Agbla
3. Esin Nkereuwem
4. Sheila A. Owusu
5. Nuredin Mohammed
6. Abdou K. Sillah
7. Omolola M. Atalabi
8. Uzochukwu Egere
9. Beate Kampmann
10. Toyin Togun
This article has no evaluationsLatest version May 20, 2026

Discuss this preprint

Listed in

Abstract

Methods

Findings

Interpretation

Article activity feed

Related articles

Interpretable machine learning for coeliac disease diagnosis: quantitative morphometry of duodenal biopsies

Optimisation of steatotic liver disease screening algorithm for resource-poor settings using machine learning

Validation of a Paediatric-Optimized Computer-Aided Detection System for Tuberculosis Using Bayesian Latent Class Analysis