Interactions with polygenic background impact quantitative traits in the UK Biobank
Discuss this preprint
Start a discussion What are Sciety discussions?Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
Association studies have linked many genetic variants to a variety of phenotypes but under-standing the biological mechanisms underlying these signals remains a major challenge. Since genes operate within complex networks, statistical interactions between genetic mutations that reflect biological pathways are expected to exist. However, their discovery has been hampered by the vast search space of variant combinations and the multiplicatively small expected effect sizes of interactions. To increase power, we created a test for interaction between single-nucleotide polymorphisms (SNPs) and groups of other variants with a direct effect on a phenotype aggregated in a polygenic score (PGS) which can be performed for any quantitative trait. In realistic simulations, this method avoids false positives and is well powered to find interaction networks. We apply it to 97 quantitative phenotypes in European samples in the UK Biobank and identify 144 independent interactions affecting 52 different traits, including important disease risk variants at genes such as APOE , FTO or TCF7L2 . We develop approaches to refine identified signals and detect 38 pairwise interactions between SNPs. These include known interactions between ABO , FUT2 and TREH affecting alkaline phosphatase levels which are shown to be part of a larger network including PIGC and FUT6 , as well as an interaction for eosinophil levels between IL33 and ALOX15 , two genes whose functional interaction has recently been implicated in asthma. Finally, we propose a method to partition PGSs according to the binding sites of more than 1100 transcription factors using the HOCOMOCO motif database and test for interactions involving functionally partitioned scores. We identify 12 interactions affecting eight traits, two of which directly reflect known regulatory relationships such as that between TCF7L2 (a key regulator of glucose metabolism) and the transcription factor KDM2A , which are known to interact functionally within the Wnt signalling pathway, affecting glycated haemoglobin levels. This work significantly extends the set of known epistatic effects for human phenotypes and shows how statistical interactions can reflect biological interdependencies between genes.