HEAL-KGGen: A Hierarchical Multi-Agent LLM Framework with Knowledge Graph Enhancement for Genetic Biomarker-Based Medical Diagnosis

Kaiwen Zuo
Zixuan Zhong
Peizhou Huang
Shiyan Tang
Yuyan Chen
Yirui Jiang

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

The discovery and validation of genetic biomarkers across diverse diseases demand intelligent systems capable of integrating complex multi-omics data with clinical relevance. We introduce HEAL-KGGen, an end-to-end framework that enhances Large Language Models (LLMs) through a hierarchical multi-agent architecture and an automatically constructed medical knowledge graph. The system includes a General Practitioner (GP) agent for initial biomarker triage and specialist agents for genomics, transcriptomics, proteomics, and clinical interpretation. The core innovation of HEAL-KGGen lies in its dynamic knowledge graph pipeline, which combines entity extraction based on patterns and semantics, ontology-aligned normalization (using UMLS, MeSH, SNOMED CT) and the construction of multi-source relationships from biomedical databases and literature. Retrieved subgraphs are transformed into contextual prompts that guide LLM reasoning via structured, explainable pathways. Our experiments show that HEAL-KGGen significantly improves question-answering accuracy across multiple mainstream large language models, with the highest improvement observed on Claude 3.5 Sonnetachieving a 43.75% increase in accuracy., confirming the value of domain-specific graph knowledge in advancing LLM performance for genetic and molecular diagnostics.

Version published to 10.1101/2025.06.03.657521 on bioRxiv
Jun 6, 2025

PRESSnet: a novel framework for patient stratification and biomarker discovery using clinical knowledge graphs

This article has 11 authors:
1. Jake Cohen-Setton
2. Shruti Shikhare
3. Ioannis Kagiampakis
4. Domingo Salazar
5. Miguel Goncalves
6. Elizabeth Coker
7. Sanddhya Jayabalan
8. Damian Bikiel
9. Ben Sidders
10. Etai Jacob
11. Krishna Bulusu
This article has no evaluationsLatest version Dec 15, 2025
Deep Learning Architectures for Multi-Omics Data Integration: Bridging Biomarker Discovery and Clinical Translation

This article has 2 authors:
1. Akshay Krishnan Pushparaj
2. Malarmathi Muthukumar
This article has no evaluationsLatest version Jan 26, 2026
LLMAgent4Bio: LLM Agents for Biological Intelligence Across Genomics, Proteomics, Spatial Biology, and Biomedicine

This article has 9 authors:
1. Sajib Acharjee Dip
2. Dipanwita Mallick
3. Uddip Acharjee Shuvo
4. Shovito Barua Soummo
5. Fazle Rafsani
6. Bikash Kumar Paul
7. Nazifa Ahmed Moumi
8. Shafayat Ahmed
9. Liqing Zhang
This article has no evaluationsLatest version Dec 16, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

PRESSnet: a novel framework for patient stratification and biomarker discovery using clinical knowledge graphs

Deep Learning Architectures for Multi-Omics Data Integration: Bridging Biomarker Discovery and Clinical Translation

LLMAgent4Bio: LLM Agents for Biological Intelligence Across Genomics, Proteomics, Spatial Biology, and Biomedicine