Dharma: A novel, clinically grounded machine learning framework for pediatric appendicitis—diagnosis, severity assessment and evidence-based clinical decision support

Anup Thapa Kshetri
Subash Pahari
Shashank Timilsina
Binay Chapagain

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Acute appendicitis is a common but diagnostically challenging surgical emergency in children. Existing linear scoring systems lack sufficient accuracy for standalone use, while advanced imaging is constrained by risks of sedation, contrast, and radiation. Furthermore, no available tools provide prognostic guidance.

We introduce Dharma , a machine learning framework consisting of a clinically grounded imputer and two random forest classifiers for diagnosis and severity assessment. Designed for real-world bedside use, Dharma is open-sourced and accessible through a web application.

Dharma achieved excellent diagnostic performance, with an AUC-ROC of 0.98 [0.97–0.99] and accuracy of 93% [91–95]. For prognostic classification, it identified complicated appendicitis with high sensitivity (96% [93–99]) and negative predictive value (97% [94–99]). In cases without appendix visualization—a frequent limitation in resource-constrained settings—Dharma maintained strong performance (AUC-ROC 0.96 [0.93–0.99]), with specificity of 97% [93–c100] and PPV of 93% [84–c100] at a 44% threshold, and sensitivity of 92% [84–98] with NPV of 95% [91–99] at a 25% threshold. These threshold-dependent trade-offs enable Dharma to support both ruling in and ruling out appendicitis within diverse clinical workflows.

Beyond pediatric appendicitis, Dharma’s open-source framework and clinically grounded design provide a generalizable foundation for developing equitable and practical decision-support tools in healthcare.

Author Summary

Accurate diagnosis and risk stratification of pediatric appendicitis remain challenging due to heterogeneous clinical presentations, the absence of definitive biomarkers, and limited access to reliable imaging, particularly in resource-constrained settings. Existing clinical scoring systems, while simple and widely used, rely on fixed linear assumptions, lack interpretability at the individual patient level, and are restricted to diagnostic decision-making without addressing disease severity or prognosis.

To address these limitations, we developed Dharma , a clinically grounded, interpretable machine learning-based framework designed to support both diagnosis and severity assessment of pediatric appendicitis using routinely available clinical, laboratory, and radiological data. Rather than replacing clinical judgment, Dharma is intentionally designed to mirror real-world clinical reasoning, explicitly model non-linear interactions among features, and provide transparent explanations for its predictions.

Implemented as an open-access, web-based clinical decision support tool, Dharma delivers real-time, evidence-based risk estimates that are adaptable to varying resource settings. Beyond pediatric appendicitis, this work demonstrates a generalizable, medicine-first approach to developing equitable and interpretable AI systems that complement clinician decision-making and align with the evolving needs of 21st-century healthcare.

Version published to 10.1101/2025.05.27.25328468 on medRxiv
May 29, 2025

Conceptual Clinical Variables Enhancing the Alvarado Score in Pediatric Appendicitis: Lessons for Artificial Intelligence Models

This article has 1 author:
1. PORNSRI THANACHATCHAIRATTANA
This article has no evaluationsLatest version Feb 3, 2026
Prospective Multicenter External Validation of the BIDIAP Index for the Diagnosis of Pediatric Acute Appendicitis

This article has 14 authors:
1. Javier Arredondo Montero
2. Andrea Herreras Martínez
3. Luis Rello Varas
4. Alicia Escudero Villafañe
5. Marina Iglesias Oricheta
6. Maria del Mar Larrea Ortiz-Quintana
7. Lucía Fernández Rodríguez
8. Pablo Aguado Roncero
9. Maria Carmen Campos Calleja
10. Ricardo Díez
11. Samuel Sáez Álvarez
12. Carmen Ruiz de la Cuesta Martín
13. Carlos Delgado Miguel
14. Rafael Fernández Atuan
This article has no evaluationsLatest version Jan 16, 2026
Prospective Multicenter External Validation of the BIDIAP Index for the Diagnosis of Pediatric Acute Appendicitis

This article has 14 authors:
1. Javier Arredondo Montero
2. Andrea Herreras Martínez
3. Luis Rello Varas
4. Alicia Escudero Villafañe
5. Marina Iglesias Oricheta
6. Maria del Mar Larrea Ortiz-Quintana
7. Lucía Fernández Rodríguez
8. Pablo Aguado Roncero
9. Maria Carmen Campos Calleja
10. Ricardo Díez
11. Samuel Sáez Álvarez
12. Carmen Ruiz de la Cuesta Martín
13. Carlos Delgado Miguel
14. Rafael Fernández Atuan
This article has no evaluationsLatest version Jan 16, 2026

Discuss this preprint

Listed in

Abstract

Author Summary

Article activity feed

Related articles

Conceptual Clinical Variables Enhancing the Alvarado Score in Pediatric Appendicitis: Lessons for Artificial Intelligence Models

Prospective Multicenter External Validation of the BIDIAP Index for the Diagnosis of Pediatric Acute Appendicitis

Prospective Multicenter External Validation of the BIDIAP Index for the Diagnosis of Pediatric Acute Appendicitis