Leveraging genomic large language models to enhance causal genotype-brain-clinical pathways in Alzheimer’s disease

Qiao Liu
Wanwen Zeng
Hongtu Zhu
Lexin Li
Wing Hung Wong
Alzheimer’s Disease Neuroimaging Initiative

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Genome-wide association studies (GWAS) have identified numerous Alzheimer’s disease (AD)- associated variants. However, how these variants contribute to the etiology of AD remains largely elusive. Recent advances in genomic large language models (LLMs) offer new opportunities to interpret the genetic variation observed in personal genome. In this study, we propose epiBrainLLM, a novel computational framework that leverages genomic LLM to enhance our understanding of the causal pathways from genotypes to brain measures to AD-related clinical phenotypes. epiBrainLLM will first convert the personal DNA sequence into a diverse set of genomic and epigenomic features using a pretrained genomic LLM and then use these features to further predict phenotypes. Across various experimental settings, epiBrainLLM significantly improves causal analysis compared to traditional genotype association approach. We conclude that epiBrainLLM provides a novel perspective for understanding the regulatory mechanisms underlying the AD disease etiology, potentially offering insights into complex disease mechanisms beyond AD.

Version published to 10.1101/2024.10.03.24314824 on medRxiv
Oct 4, 2024

Decoding Complex Genotype-Phenotype Interactions by Discretizing the Genome

This article has 6 authors:
1. Jędrzej Kubica
2. Hetvi Jethwani
3. Krzysztof H. Banecki
4. Mauricio Moldes
5. Dariusz Plewczynski
6. Ben Busby
This article has no evaluationsLatest version Dec 17, 2025
The Prune-Without-Repair Model for Schizophrenia Cognitive Impairment: Evidence from Convergent GWAS Re-Analyses

This article has 1 author:
1. Ngo Cheung
This article has no evaluationsLatest version Jan 6, 2026
Evidence-based genetic variants to gene mapping and prioritization uncovers distinct molecular pathophysiology and therapeutic landscape in polycystic ovary syndrome patients of different ethnicities.

This article has 2 authors:
1. Debojyoti De
2. Sindhuja Rajavelu
This article has no evaluationsLatest version Jan 22, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Decoding Complex Genotype-Phenotype Interactions by Discretizing the Genome

The Prune-Without-Repair Model for Schizophrenia Cognitive Impairment: Evidence from Convergent GWAS Re-Analyses

Evidence-based genetic variants to gene mapping and prioritization uncovers distinct molecular pathophysiology and therapeutic landscape in polycystic ovary syndrome patients of different ethnicities.