The R.O.A.D. to clinical trial emulation

Read the full article See related articles

Listed in

This article is not in any list yet, why not save it to one of your lists.
Log in to save this article

Abstract

Observational studies provide the only evidence on the effectiveness of interventions when randomized controlled trials (RCTs) - apart from the initial RCT that establishes the efficacy of a treatment compared to a placebo - are impractical due to cost, ethical concerns, or time constraints. While many methodologies aim to draw causal inferences from observational data, there is a growing trend to model observational study designs after hypothetical or existing RCTs, a strategy known as “target trial emulation.” Despite its potential, causal inference through target trial emulation is challenging because it cannot fully address the confounding bias inherent in real-world data due to the lack of randomization. In this work, we present a novel framework for target trial emulation that aims to overcome several key limitations, including confounding bias. The framework proceeds as follows: First, we apply the eligibility criteria of a specific trial to an observational cohort derived from real-world data. We then “correct” this cohort by extracting a subset that, through optimization techniques, matches both the distribution of covariates and baseline prognoses (i.e., the prognosis in the trial’s control group) of the target RCT. Next, we address unmeasured confounding by adjusting the prognosis estimates of the treated group to align with those observed in the trial, using cost-sensitive counterfactual models. Following trial emulation, we go a step further by leveraging the emulated cohort to train optimal decision trees, developed by our team, to identify subgroups of patients exhibiting heterogeneity in treatment effects (HTE). The absence of confounding is verified using two external models, and the validity of the treatment effects estimated by our framework is independently confirmed by the team responsible for the original trial we emulate. To our knowledge, this is the first framework to successfully address both observed and unobserved confounding, a challenge that has historically limited the use of randomized trial emulation and causal inference in general since the 1950s. Additionally, our framework holds promise in advancing precision or personalized medicine by identifying patient subgroups that benefit most from specific treatments.

Article activity feed