Diagnostic Test Accuracy Meta-Analysis: A Practical Guide to Hierarchical Models

Javier Arredondo Montero

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Background: Accurate evaluation of diagnostic tests is essential to guide clinical decision-making, particularly in surgical practice. Systematic reviews and meta-analyses of diagnostic test accuracy (DTA) are key for evidence synthesis; however, traditional approaches, including univariate pooling or simplified summary ROC (SROC) models such as the Moses–Littenberg method, often yield biased and clinically misleading estimates.Methods: This article presents a methodological guide to hierarchical random-effects models for DTA meta-analysis, structured around current evidence and best practices. Based on this framework, a simulated dataset was generated, and a comprehensive meta-analysis was performed. The analysis illustrates key methodological concepts, interpretation of model outputs, and the use of complementary tools, including likelihood ratios, scattergrams, meta-regression, publication bias assessment, and outlier detection. It also provides a critical comparison of Stata commands for DTA meta-analysis (metandi, midas, metadta), outlining their methodological strengths and limitations to guide researchers in tool selectionResults: The traditional meta-analysis, performed with Meta-DiSc 1.4, applied the DerSimonian–Laird and Moses–Littenberg methods, produced separate sensitivity and specificity pooled estimates with artificially narrow confidence intervals and a symmetric, theoretical SROC curve extrapolated beyond the observed data range, thereby ignoring threshold variability and underestimating between-study heterogeneity. In contrast, the hierarchical random-effects model provided more realistic and clinically interpretable estimates. Joint modeling of sensitivity and specificity revealed substantial between-study variability, a strong negative correlation consistent with a threshold effect, an elliptical confidence region around the summary point (reflecting uncertainty in mean sensitivity/specificity), together with a broader prediction region indicating where 95% of future studies might fall. Influence diagnostics identified outliers and highly influential studies. Conclusions: Promoting the correct application and interpretation of hierarchical models in DTA meta-analyses is essential to ensure high-quality, reliable, and scientifically robust evidence.

Version published to 10.20944/preprints202506.2461.v2
Sep 5, 2025
Version published to 10.20944/preprints202506.2461.v1
Jun 30, 2025

The State of the Art in Meta-Analysis Software: Evolution, Shortcomings, and Future Directions

This article has 1 author:
1. Javier Arredondo Montero
This article has no evaluationsLatest version Aug 20, 2025
Predictive Performance Precision Analysis in Medicine: Identification of low-confidence predictions at patient and profile levels (MED3pa I)

This article has 7 authors:
1. Olivier Lefebvre
2. Félix Camirand Lemyre
3. Jean-François Ethier
4. Lyna Hiba Chikouche
5. Ludmila Amriou
6. Dan Poenaru
7. Martin Vallìeres
This article has no evaluationsLatest version Aug 26, 2025
Interpreting Heterogeneity in Meta-Analysis: A Unified Framework Across Intervention, Diagnostic, and Prognostic Reviews

This article has 1 author:
1. Javier Arredondo Montero
This article has no evaluationsLatest version Sep 2, 2025

Listed in

Abstract

Article activity feed

Related articles

The State of the Art in Meta-Analysis Software: Evolution, Shortcomings, and Future Directions

Predictive Performance Precision Analysis in Medicine: Identification of low-confidence predictions at patient and profile levels (MED3pa I)

Interpreting Heterogeneity in Meta-Analysis: A Unified Framework Across Intervention, Diagnostic, and Prognostic Reviews