A unified meta-regression model identifies genes associated with epilepsy

Oscar Aguilar
Mijail Rivas
Manuel A. Rivas

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Epilepsy is a highly heterogeneous disorder thought to have strong genetic components. However, identifying these risk factors using whole-exome sequencing studies requires very large sample sizes and good signal-to-noise ratio in order to assess the association between rare variants in any given gene and disease. We present an approach for predicting constraint in the human genome through application of a Hidden Markov Model (HMM) to whole exome sequencing (WES) data. Using the Regeneron Genetics Center Million Exome dataset and the AllofUs whole genome sequencing data, we predict the probability of observing no variants across the population for each position in the genome. We then incorporate the predictions with the “rejected substitutions” (RS) score from Genomic Evolutionary Rate Profiling (GERP), pathogenicity predictions from AlphaMissense (AM), and pLoF/Missense annotations from Epi25 into a model that detects epilepsy-associated genes. We identify a set of significant ( p < 3. 4 × 10 ⁻⁷ ) genes which did not meet exome-wide significance in previous studies: KCNQ2, SCN2A, STXBP1, CACNA1A, SLC6A1, DYRK1A, KCNB1, SATB1, PCDHAC2, SP4 , and RYR2 ,. Our models allow us to evaluate the contribution of constraint, protein structure based pathogenicity prediction from AM, and pLoFs jointly. We show that unifying these moderators into a single model allows us to both strengthen our evidence for genes with already-known links to epilepsy and also identify new genes with likely links to epilepsy.

Version published to 10.1101/2024.06.27.24309590 on medRxiv
Jun 28, 2024

Decoding Complex Genotype-Phenotype Interactions by Discretizing the Genome

This article has 6 authors:
1. Jędrzej Kubica
2. Hetvi Jethwani
3. Krzysztof H. Banecki
4. Mauricio Moldes
5. Dariusz Plewczynski
6. Ben Busby
This article has no evaluationsLatest version Dec 17, 2025
Large Language Models Enhance Molecular Diagnoses of Mendelian Disorders via A Novel Logic

This article has 15 authors:
1. Zefu Chen
2. Jihao Cai
3. Yongxin Yang
4. Sen Zhao
5. Guozhuang Li
6. Kexin Xu
7. Qing Li
8. Timothy Hospedales
9. Lina Zhao
10. Zhongmin Zhang
11. Zhihong Wu
12. Guixing Qiu
13. Terry Jianguo Zhang
14. Pengfei Liu
15. Nan Wu
This article has no evaluationsLatest version Dec 22, 2025
GenBlosum: On Determining Whether Cancer Mutations Are Functional or Random

This article has 2 authors:
1. Alejandro Leyva
2. Muhammad Khalid Khan Niazi
This article has no evaluationsLatest version Dec 15, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Decoding Complex Genotype-Phenotype Interactions by Discretizing the Genome

Large Language Models Enhance Molecular Diagnoses of Mendelian Disorders via A Novel Logic

GenBlosum: On Determining Whether Cancer Mutations Are Functional or Random