Identification of Biomarkers and Mechanisms for Keloid Disorder based on Comprehensive Bioinformatics Analysis and Machine Learning Algorithms
Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
Background Keloid disorder (KD) is a group of fibroproliferative skin disorders characterized by hypervascularity and excessive accumulation of the extracellular matrix (ECM) and affects individuals of all age groups. The etiology of KD is complex and still poorly understood. This study aimed to investigate biomarkers and therapeutic targets in KD on the basis of comprehensive bioinformatics analysis and machine learning of RNA autosequencing data. Methods Thirteen skin tissues from KD patients (KD samples) and 14 normal control skin tissues (control samples) were collected for RNA sequencing. Initially, differentially expressed key module genes were acquired through expression analysis with weighted gene coexpression network analysis, followed by enrichment analysis. The 10 candidate genes obtained via the CytoHubba plugin were subsequently incorporated into the least absolute shrinkage and selection operator (LASSO) and support vector machine recursive feature elimination (SVM-RFE) to recognize feature genes associated with KD. Furthermore, biomarkers were determined via expression level analysis, followed by enrichment analysis and immunoinfiltration analysis to elucidate the pathogenesis of KD. Results A total of 420 differentially expressed key module genes were identified, and these 420 genes were enriched in collagen- and bone-associated biological functions, including “collagen fibril organization” and “bone development”. With respect to the 10 candidate genes, five feature genes were subsequently obtained through LASSO and SVM-RFE, and among them, NID2, MFAP2, COL8A1, and P4HA3 had significant expression differences between the KD and control samples as well as consistent expression patterns in both datasets; these genes were considered biomarkers. These four biomarkers had excellent abilities to diagnose KD patients, and there were significant positive correlations between these four biomarkers. Functional enrichment analysis suggested that the main enriched KEGG pathways for biomarkers were “steroid hormone biosynthesis”, “cytokine–cytokine receptor interaction”, etc. Furthermore, immune analysis suggested that four biomarkers were negatively linked to type 17 T helper cells and positively linked to 15 immune cells (activated B cells, central memory CD4 T cells, etc.). Conclusion NID2, MFAP2, COL8A1, and P4HA3 were identified as biomarkers for KD, providing more targeted and effective diagnostic and therapeutic strategies for KD.