HEAL-KGGen: A Hierarchical Multi-Agent LLM Framework with Knowledge Graph Enhancement for Genetic Biomarker-Based Medical Diagnosis
Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
The discovery and validation of genetic biomarkers across diverse diseases demand intelligent systems capable of integrating complex multi-omics data with clinical relevance. We introduce HEAL-KGGen, an end-to-end framework that enhances Large Language Models (LLMs) through a hierarchical multi-agent architecture and an automatically constructed medical knowledge graph. The system includes a General Practitioner (GP) agent for initial biomarker triage and specialist agents for genomics, transcriptomics, proteomics, and clinical interpretation. The core innovation of HEAL-KGGen lies in its dynamic knowledge graph pipeline, which combines entity extraction based on patterns and semantics, ontology-aligned normalization (using UMLS, MeSH, SNOMED CT) and the construction of multi-source relationships from biomedical databases and literature. Retrieved subgraphs are transformed into contextual prompts that guide LLM reasoning via structured, explainable pathways. Our experiments show that HEAL-KGGen significantly improves question-answering accuracy across multiple mainstream large language models, with the highest improvement observed on Claude 3.5 Sonnetachieving a 43.75% increase in accuracy., confirming the value of domain-specific graph knowledge in advancing LLM performance for genetic and molecular diagnostics.