Machine Learning-Enhanced Extraction of Protein Signatures of Renal Cell Carcinoma from Proteomics Data

Read the full article See related articles

Listed in

This article is not in any list yet, why not save it to one of your lists.
Log in to save this article

Abstract

In this study, we generated label-free data-independent acquisition (DIA)-based liquid chromatography (LC)-mass spectrometry (MS) proteomics data from 261 renal cell carcinomas (RCC) and 195 normal adjacent tissues (NAT). The RCC tumors included 48 non-clear cell renal cell carcinomas (non-ccRCC) and 213 ccRCC. A total of 219,740 peptides and 11,943 protein groups were identified with 9,787 protein groups per sample on average. We adopted a comprehensive approach to select representative samples with different mutation sites, considering histopathological, immune, methylation, and non-negative matrix factorization (NMF)-based subtypes, along with clinical characteristics (gender, grade, and stage) to capture the complexity and diversity of ccRCC tumors. We used machine learning identified 55 protein signatures that distinguish RCC tumors from NATs. Furthermore, 39 protein signatures that differentiate different RCC tumor subtypes were also identified. Our findings offer an extensive perspective of the proteomic landscape in RCC, illuminating specific proteins that serve to distinguish RCC tumors from NATs and among various RCC tumor subtypes.

Article activity feed