Identification of Novel Biomarkers in Huntington’s Disease Based on differential Gene Expression Meta-Analysis and Machine Learning Approach
Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
Huntington's disease (HD) is a severe and progressive neurodegenerative disease for which therapeutic options have so far been confined to symptomatic treatment. Currently, the diagnosis relies on the signs and symptoms shown by patients; however, by that stage, the psychomotor issues have progressed to a point where reversal of the condition is unattainable. Although numerous clinical trials have been actively investigating therapeutic agents aimed at preventing the onset of disease or slowing down the disease progression, there has been a constant need for reliable biomarkers to assess neurodegeneration, monitor disease progression, and assess the efficacy of treatments accurately. Therefore, to discover the key biomarkers associated with the progression of HD, we employed bioinformatics and machine learning (ML) to create a robust pipeline that integrated differentially expressed gene (DEG) analysis with ML to select potential biomarkers. We performed a meta-analysis to identify DEGs using three Gene Expression Omnibus (GEO) microarray datasets from different platforms related to HD-affected brain tissue, applying both relaxed and strict criteria to identify differentially expressed genes. Subsequently, focusing only on genes identified through the inclusive threshold, we employed 19 diverse ML techniques to explore the common genes that contributed to the top three selected ML algorithms and the shared genes that had an impact on the ML algorithms and were observed in the meta-analysis using the stringent condition were selected. Additionally, a receiver operating characteristic (ROC) analysis was conducted on an external dataset to validate the discriminatory power of the identified genes, which led to the selection of GABRD and PHACTR1 as key biomarkers for HD. Our comprehensive methodology, which integrates DEG meta-analysis with ML techniques, enabled a systematic prioritization of these biomarkers, providing valuable insights into their biological significance and potential for further validation in clinical research.