Proteomic and clinical impact of human knockouts in British South Asians

Read the full article See related articles

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.
Log in to save this article

Abstract

Human loss-of-function (LoF) variants affecting both copies of a gene (“human knockouts”) provide a unique opportunity to directly study function and clinical impact of genes but are very rare in most populations sequenced to date. Here we study 1,569 British Bangladeshi and -Pakistani adults who were recalled for plasma sampling for proteomic profiling using three distinct technologies (covering >12,000 proteins) from 55k whole exome sequenced Genes & Health adults – a cohort enriched for rare, biallelic (homozygous) variants due to high autozygosity. We identified 199 individuals with rare homozygous predicted LoF genotypes (pLoF) for which the respective cis- protein was measured by at least one technology, and observed extreme (> 3SDs) cis- protein underexpression in 41 individuals (median z-score = −9.72 (range: −19.61 to −4.78) and overexpression in 2 individuals (median z-score = 8.1 (range 4.80 – 11.40)), representing 19% of these variants. For missense homozygotes, we observed 158 individuals with significantly under-expressed cis- protein (median z-score = −6.95 (range: −28.16 to −3.95)) and 62 individuals over-expressed, median z-score = 5.65 (range: 4.57 to 25.08)). The majority (62%) of LoF knockout genes with an identified cis-protein effect had evidence from 2 or more platforms, highlighting the high confidence nature of these discoveries. Systematic clinical assessment of human knockouts with strong evidence of an impact on cis- protein abundance through multi-source electronic health record linkage enabled identification of 1) knockout carriers with rare disease features based on phenotypic similarity, 2) novel rare disease-causing variants, 3) evidence for reclassification of genes and variants of uncertain significance from ClinVar and rare disease panels, and 4) novel gene-phenotype associations in humans. Based on high confidence examples, we developed a machine learning model that predicted 1 in 4 pLOF and 9 in 10 missense variants are likely benign. In summary, our study provides strong human-derived insights into the fundamental biology and clinical relevance of many genes and shows the value of proteogenomic studies of human knockout carriers.

Article activity feed