Predictive Analysis of Methylation Patterns in Oral Squamous Cell Carcinoma (OSCC) Using Machine Learning
Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
Background
Oral and oropharyngeal cancers are the most common types of head and neck cancers, with over 90% originating from squamous cells in the mouth and throat. Chronic tobacco and alcohol use, inflammation, viral infections, betel quid chewing, and genetic predisposition are major risk factors for OSCC, which kills over 100,000 patients annually. Epigenetic mechanisms, such as DNA methylation, can silence tumour suppressor genes, contributing to cancer progression and patient outcomes in oral squamous cell carcinoma (OSCC).
Objective
This study aimed to predict prominent methylation signatures that can distinguish OSCC from normal cells.
Methods
Machine learning algorithms like Support Vector Machine (SVM), Random Forest (RF), and Multilayer Perceptron (MLP) were implemented using R packages and a balanced training dataset consisting of M-values of methylated CpG sites from 46 matched OSCC and normal adjacent tissue samples.
Results
MLP model demonstrated the highest accuracy of 92% on the training dataset and 100% on the blind dataset, even with a reduced feature set of just 10 significantly differentially methylated CpG sites.
Conclusion
A highly accurate and generalizable machine learning model was developed using the Multi-Layer Perceptron with multiple layers (MLP-ml) algorithm, which achieved an accuracy of 95% on an independent validation dataset of 15 OSCC tumors and 7 non-tumor adjacent tissue samples.