Prediction-powered Inference for Clinical Trials: application to linear covariate adjustment

Pierre-Emmanuel Poulet
Maylis Tran
Sophie Tezenas du Montcel
Bruno Dubois
Stanley Durrleman
Bruno Jedynak

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Prediction-powered inference (PPI) and its subsequent development called PPI++ provide a novel approach to standard statistical estimation leveraging machine learning systems to enhance unlabeled data with predictions. We use this paradigm in clinical trials. The predictions are provided by disease progression models, providing prognostic scores for all the participants as a function of baseline covariates. The proposed method would empower clinical trials by providing untreated digital twins of the treated patients while remaining statistically valid. The potential implications of this new estimator of the treatment effect in a two-arm randomized clinical trial (RCT) are manifold. First, it leads to an overall reduction of the sample size required to reach the same power as a standard RCT. Secondly, it advocates for an imbalance of controls and treated patients, requiring fewer controls to achieve the same power. Finally, this technique directly transfers any disease prediction model trained on large cohorts to practical and scientifically valid use. In this paper, we demonstrate the theoretical properties of this estimator and illustrate them through simulations. We show that it is asymptotically unbiased for the Average Treatment Effect and derive an explicit formula for its variance. An application to an Alzheimer's disease clinical trial showcases the potential to reduce the sample size.

Version published to 10.1101/2025.01.15.25320578 on medRxiv
Jan 18, 2025

“Double Machine Learning for Causal Inference in High-Dimensional Electronic Health Records”

This article has 5 authors:
1. Mike Du
2. Yuchen Guo
3. Xintong Li
4. Marti Catala
5. Daniel Pareto-Alhambra
This article has no evaluationsLatest version Jul 22, 2025
Identifiability-Guided Assessment of Digital Twins in Alzheimer’s Disease Clinical Research and Care

This article has 4 authors:
1. Juliet Jiang
2. Jeffrey R. Petrella
3. Wenrui Hao
4. the Alzheimer’s Disease Neuroimaging Initiative
This article has no evaluationsLatest version Aug 22, 2025
Interventionally-guided representation learning for robust and interpretable AI models in cancer medicine

This article has 5 authors:
1. Dom Kirkham
2. Riccardo Masina
3. Stephen-John Sammut
4. Sach Mukherjee
5. Oscar M. Rueda
This article has no evaluationsLatest version Jul 21, 2025

Listed in

Abstract

Article activity feed

Related articles

“Double Machine Learning for Causal Inference in High-Dimensional Electronic Health Records”

Identifiability-Guided Assessment of Digital Twins in Alzheimer’s Disease Clinical Research and Care

Interventionally-guided representation learning for robust and interpretable AI models in cancer medicine