Estimating Between-Trial Heterogeneity in Meta-Analyses Based on Two-Arm Clinical Trials with Outcomes Reported as Kaplan-Meier Curves

Andrea Messori
Vera Damuzzo
Andrea Ossato
Eugenia Piragine
Sara Veneziano
Vincenzo Calderone

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Background Between-trial heterogeneity is a key element in meta-analysis, traditionally quantified using the I² statistic in studies with binary outcomes. However, survival meta-analyses present additional challenges, as outcomes are usually reported through Kaplan–Meier curves and hazard ratios (HRs). Standard methods for heterogeneity estimation in this context remain poorly established, and consensus is lacking. Methods We propose a standardized approach for estimating between-trial heterogeneity in survival meta-analyses using I². The method is applicable both when individual patient data (IPD) are available (collaborative meta-analyses) and when IPD must be reconstructed from published Kaplan–Meier curves (IPDfromKM algorithm). To illustrate the approach, we re-analyzed a published meta-analysis of randomized controlled trials (RCTs) evaluating PARP inhibitor maintenance therapy in extensive-stage small-cell lung cancer. Five RCTs were included, and overall survival was the endpoint. Results The binary meta-analysis of crude survival rates yielded no significant heterogeneity (I² = 0%). By contrast, re-analysis based on reconstructed IPD and log-transformed HRs indicated moderate heterogeneity (I² = 36.3%, τ² = 0.0233, p = 0.179). Estimates of the overall treatment effect were similar between approaches (HR ≈ 1.03–1.04), though confidence intervals differed due to model specifications. Comparative evaluation with other methods (Wald test, likelihood ratio, concordance index) highlighted the unique interpretative advantages of I² in this setting. Discussion Our findings suggest that crude binary analyses may underestimate heterogeneity in survival meta-analyses. The I² statistic provides an intuitive and flexible measure of between-trial variability when survival data are expressed as HRs. While promising, this approach requires further validation across diverse clinical settings.

Version published to 10.20944/preprints202508.1816.v1
Aug 27, 2025

Methodological Analysis of Bias Risks in Adaptive Multi-Arm Platform Trials: A Case-Series from Three COVID-19 Studies

This article has 1 author:
1. Simon Reich
This article has no evaluationsLatest version Jan 6, 2026
Prognostic value of the REBECCA prognostic score and the Tabernero classification in refractory metastatic colorectal cancer treated with regorafenib: a multicenter real-world study

This article has 13 authors:
1. Martinez-Lago Nieves
2. Carnero Lopez Beatriz
3. de la Camara Gomez Juan
4. Vazquez Rivera Francisca
5. Fernandez Montes Ana
6. Cousillas Castiñeiras Antia
7. De Dios Alvarez Noemi
8. Covela Rua Marta
9. Varela Ponte Rafael
10. Agraso Busto Sara
11. Carral Maseda Alberto
12. Salgado Fernandez Mercedes
13. Reboredo Lopez Margarita
This article has no evaluationsLatest version Dec 22, 2025
Daratumumab or Elotuzumab Plus Lenalidomide and Dexamethasone in Relapsed or Refractory Multiple Myeloma: An Indirect Comparison Based on Reconstructing Individual Patient Data from Kaplan-Meier Overall Survival Curves Using an Artificial-Intelligence Algorithm

This article has 1 author:
1. Andrea Messori
This article has no evaluationsLatest version Dec 11, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Methodological Analysis of Bias Risks in Adaptive Multi-Arm Platform Trials: A Case-Series from Three COVID-19 Studies

Prognostic value of the REBECCA prognostic score and the Tabernero classification in refractory metastatic colorectal cancer treated with regorafenib: a multicenter real-world study

Daratumumab or Elotuzumab Plus Lenalidomide and Dexamethasone in Relapsed or Refractory Multiple Myeloma: An Indirect Comparison Based on Reconstructing Individual Patient Data from Kaplan-Meier Overall Survival Curves Using an Artificial-Intelligence Algorithm