Privacy-Preserving Multivariate Bayesian Regression Models for Overcoming Data Sharing Barriers in Health and Genomics

Read the full article See related articles

Listed in

This article is not in any list yet, why not save it to one of your lists.
Log in to save this article

Abstract

We present multivariate Bayesian regression models specifically designed to over-come data-sharing barriers in health and genomics. These multi-response models are well suited for scenarios where data must remain decentralized due to privacy, intellectual property, or regulatory constraints. In extensive simulation studies, our approach consistently outperformed traditional single-response models trained on individual datasets, particularly under real-world conditions such as low signal, unbalanced cohorts, and high-dimensional feature spaces. For the first time, we demonstrate that multivariate Bayesian regression can be implemented using or-thogonal transformations of sufficient statistics, enabling fully privacy-preserving analysis without sharing individual-level data. The models are scalable, inter-pretable, and applicable to predictive tasks across diverse collaborators, supporting secure data-driven research in domains such as clinical trials, biomarker discovery, and precision health.

Article activity feed