From GWAS to Causal Inference: A Beginner’s Guide to Mendelian Randomization with Code Examples

Read the full article See related articles

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.
Log in to save this article

Abstract

Background: Mendelian randomization (MR) is a powerful approach for assessing causal relationships between risk factors and health outcomes using genetic variants as instrumental variables (IVs). The increasing availability of large genome-wide association study (GWAS) summary statistics from resources such as UK Biobank, FinnGen, and other population-based cohorts has made MR analyses more accessible than ever. However, many available guidelines and tutorials remain highly technical, requiring advanced knowledge of statistical genetics and R programming. Objective: This paper aims to provide a clear, step-by-step guide for conducting MR analyses using GWAS summary statistics, designed specifically for non-technical researchers. Methods: We outline a structured workflow covering key stages of MR analysis, including dataset selection, quality control, IVs selection, harmonization, and causal estimation. The workflow integrates online tools for quality control and demonstrates the use of commonly applied R packages such as TwoSampleMR. Each step is illustrated with example code and practical guidance to promote reproducibility. Results and Conclusion: The proposed workflow supports the process of conducting MR analyses, bridging the gap between theoretical guidelines and hands-on implementation. By offering an accessible and reproducible framework, this tutorial aims to help applied researchers, clinicians, and early-career scientists confidently perform MR analyses and interpret causal findings using publicly available GWAS summary data.

Article activity feed