cellSTAAR: Incorporating single-cell-sequencing-based functional data to boost power in rare variant association testing of non-coding regions
Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
Whole genome sequencing (WGS) studies have identified hundreds of millions of rare variants (RVs) and have enabled RV association tests (RVATs) of these variants with complex traits and diseases. Analysis of non-coding variants is challenged by the considerable variability in regulatory function which candidate Cis-Regulatory Elements (cCREs) exhibit across cell types. We propose cellSTAAR, which integrates WGS data with single-cell ATAC-seq data to capture variability in chromatin accessibility across cell types via the construction of cell-type-specific functional annotations and variant sets. To reflect the uncertainty in cCRE-gene linking, cellSTAAR also links cCREs to their target genes using an omnibus framework which aggregates results from a variety of popular linking approaches. We applied cellSTAAR on Freeze 8 (N = 60,000) of the NHLBI Trans-Omics for Precision Medicine (TOPMed) consortium data to four lipids phenotypes: LDL cholesterol, a binary variable corresponding to high LDL cholesterol, HDL cholesterol, and triglycerides. We also provide replication results for all four phenotypes using UK Biobank (N = 190,000). Evidence from simulation studies and our real data analysis demonstrates that cellSTAAR boosts power and improves interpretation of RVATs of cCREs.