Machine learning-based analysis of genomic and transcriptomic data unveils sarcoma clusters with superlative prognostic and predictive value
Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
Soft tissue sarcomas (STS) histopathological classification system has several conceptual caveats, impacting prognostication and treatment. The clinical and molecular-based tools currently employed to estimate prognosis also have limitations. Clinically driven molecular profiling studies may cover these gaps. We performed DNA sequencing (DNAseq) and RNA sequencing (RNAseq), portraying the molecular profile of 102 samples of 3 of the most common STS subtypes. The RNAseq data was analyzed using unsupervised machine learning models, unravelling previously unknown molecular patterns and identifying 4 well-defined transcriptomic clusters. These transcriptomic clusters have a clear prognostic value, a finding that was externally validated. This transcriptomic cluster-based classification’s prognostic value is superior to the prognostic accuracy of currently used clinical-based (SARCULATOR nomograms) and molecular-based (CINSARC) prognostication tools. The analysis of DNAseq data from the same cohort of samples revealed a plethora of unique and, in some cases, never documented molecular targets for precision treatment across different transcriptomic clusters.