Accounting for Structured Missingness in Canonical Correlation Analysis

Lav Radosavljević
Stephen M. Smith
Thomas E. Nichols

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

A particularly challenging form of missing data is structured missingness, where sets of subjects and variables consistently have missing data. For tabular data from sub-studies or modalities, structured missingness can come from non-participation in followup studies, which creates large blocks of missing data. Canonical Correlation Analysis (CCA) is a multivariate modelling tool commonly used to link two different set of variables, and in neuroimaging has typically been used to find associations between imaging and non-imaging variables. Motivated by CCA, we propose a new method for covariance estimation from incomplete data that handles data with a mix of structured and unstructured missingness, assuming Missing at Random (MAR). Our proposed method is compared to existing methodology by way of evaluation on simulated data and on real data from subjects in the UK Biobank brain imaging cohort.

Version published to 10.1101/2025.10.09.25337581 on medRxiv
Oct 10, 2025

Evaluating Imputation Methods for Handling Missing Data in Complex Survey Designs: Evidence from the India DHS 2017–18

This article has 6 authors:
1. Mahfuzer Rohman
2. Md Sabbir Hossain
3. Md Fakrul Islam
4. Prosenjit Basak Arka
5. Md Rafi Hasan
6. Md Jamal Uddin
This article has no evaluationsLatest version Jan 23, 2026
Missing Data in OHCA Registries: How Multiple Imputation Methods Affect Research Conclusions—Paper II

This article has 4 authors:
1. Stella Jinran Zhan
2. Seyed Ehsan Saffari
3. Marcus Eng Hock Ong
4. Fahad Javaid Siddiqui
This article has no evaluationsLatest version Jan 16, 2026
Missing Data in Intensive Longitudinal Suicide Research: A Monte Carlo simulation study

This article has 1 author:
1. Aleksandr Karnick
This article has no evaluationsLatest version Feb 4, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Evaluating Imputation Methods for Handling Missing Data in Complex Survey Designs: Evidence from the India DHS 2017–18

Missing Data in OHCA Registries: How Multiple Imputation Methods Affect Research Conclusions—Paper II

Missing Data in Intensive Longitudinal Suicide Research: A Monte Carlo simulation study