High-dimensional confounding in causal mediation: a comparison study of double machine learning and regularized partial correlation network

Ming Chen
Tanya T. Nguyen
Jinyuan Liu

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

In causal mediation analyses, of interest are the direct or indirect pathways from exposure to an outcome variable. For observation studies, massive baseline characteristics are collected as potential confounders to mitigate selection bias, possibly approaching or exceeding the sample size. Accordingly, flexible machine learning approaches are promising in filtering a subset of relevant confounders, along with estimation using the efficient influence function to avoid overfitting. Among various confounding selection strategies, two attract growing attention. One is the popular debiased, or double machine learning (DML), and another is the penalized partial correlation via fitting a Gaussian graphical network model between the confounders and the response variable. Nonetheless, for causal mediation analyses when encountering high-dimensional confounders, there is a gap in determining the best strategy for confounding selection. Therefore, we exemplify a motivating study on the human microbiome, where the dimensions of mediator and confounders approach or exceed the sample size to compare possible combinations of confounding selection methods. By deriving the multiply robust causal direct and indirect effects across various hypotheses, our comprehensive illustrations offer methodological implications on how the confounding selection impacts the final causal target parameter estimation while generating causality insights in demystifying the “gut-brain axis”. Our results highlighted the practicality and necessity of the discussed methods, which not only guide real-world applications for practitioners but also motivate future advancements for this crucial topic in the era of big data.

Version published to 10.1101/2024.10.12.617110v1 on bioRxiv
Oct 12, 2024

Mendelian Randomization: A Robust Approach for Causal Inference in Observational Data (Motivated by the Trending Study on Cheese Intake and Osteoarthritis by Song Wen et al.)

This article has 7 authors:
1. Ricardo Pietrobon
2. Aline Machiavelli
3. Luiza Paulsen Rodrigues
4. Amit Agrey
5. Lizzy Nkeangnyi
6. Victor Galvão
7. Lucas Teixeira
This article has no evaluationsLatest version Mar 14, 2025
Jointly modelling multiple ancestral populations using GWAS summary data improves causal inference

This article has 8 authors:
1. Gibran Hemani
2. Yoonsu Cho
3. Amanda Chong
4. Tom Palmer
5. Amy Mason
6. John Ferguson
7. David Evans
8. George Davey Smith
This article has no evaluationsLatest version Mar 27, 2025
A Combined Predictive and Causal Approach for Neighborhood-Level Diabetes Detection

This article has 7 authors:
1. Mohammad Noaeen
2. Amirhosein Rostami
3. Ibrahim Ghanem
4. Olli Saarela
5. Karim Keshavjee
6. Jeffrey R. Brook
7. Zahra Shakeri
This article has no evaluationsLatest version Mar 5, 2025

Listed in

Abstract

Article activity feed

Related articles

Mendelian Randomization: A Robust Approach for Causal Inference in Observational Data (Motivated by the Trending Study on Cheese Intake and Osteoarthritis by Song Wen et al.)

Jointly modelling multiple ancestral populations using GWAS summary data improves causal inference

A Combined Predictive and Causal Approach for Neighborhood-Level Diabetes Detection