Variational biomarker pooling with calibration for time-to-event outcomes across multiple clinical studies

Jiali Song
Zhiwei Rong
Yan Hou

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Background Biomarkers are widely used in oncology research to study disease progression and predict survival outcomes. Pooling biomarker data across studies can improve precision, but pooled analyses are often affected by assay heterogeneity. Many studies use calibration designs because re-assaying all biospecimens on a reference platform is impractical. In pooled analyses, calibration can be incomplete when some studies have no reference measurements. This setting can be viewed as covariate measurement error in time-to-event models. Most existing methods were developed for single-cohort designs with validation or replicate measurements, and they do not directly accommodate study-specific calibration with incomplete reference data in pooled survival analyses. Methods In this paper, we propose Variational Inference-Based Biomarker Pooling (VIBP) for censored survival data. VIBP treats the target biomarker as latent and jointly models study-specific calibration and the survival outcome using parametric survival models. Variational inference provides scalable estimation, and uncertainty is quantified using bootstrap. Results Through extensive simulation studies for exponential and Weibull survival data, we find that VIBP consistently provides estimates with lower bias, smaller mean squared error, and near-nominal 95% coverage across a wide range of effect sizes and censoring rates. We further apply VIBP to a real-world dataset to evaluate the association between DJ-1 protein levels and overall survival in lung squamous cell carcinoma. The results highlight the ability of VIBP to recover meaningful survival associations under sparse and heterogeneous calibration information. Conclusions The proposed method provides accurate and robust estimation in the presence of inter-study variability and partially observed reference measurements, and it remains applicable even when some studies have no reference data.

Version published to 10.21203/rs.3.rs-8764450/v1 on Research Square
Feb 12, 2026

Generative AI-Based Imputation to Preserve Data Fidelity and Enhance Outcome Prediction: A Multi-Institutional Study in Cardiac Surgery

This article has 11 authors:
1. Negin Maddah
2. Amin Ramezani
3. Qingchu Jin
4. Jakob Wollborn
5. Akinobu Itoh
6. Jaime B. Rabb
7. Felistas Mazhude
8. Robert S. Kramer
9. Douglas B. Sawyer
10. Raimond L. Winslow
11. Farhad R. Nezami
This article has no evaluationsLatest version Jan 23, 2026
The Survival Double Descent: Generalization Dynamics of Deep Neural Networks in Time-to-Event Analysis

This article has 2 authors:
1. Steven Hart
2. Ann Oberg
This article has no evaluationsLatest version Mar 4, 2026
Bayesian Semiparametric Mixture Cure (Frailty) Models

This article has 2 authors:
1. Fatih Kızılaslan
2. Valeria Vitelli
This article has no evaluationsLatest version Feb 25, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Generative AI-Based Imputation to Preserve Data Fidelity and Enhance Outcome Prediction: A Multi-Institutional Study in Cardiac Surgery

The Survival Double Descent: Generalization Dynamics of Deep Neural Networks in Time-to-Event Analysis

Bayesian Semiparametric Mixture Cure (Frailty) Models