Intrinsic tumor factors and extrinsic environmental and social exposures contribute to endometrial cancer recurrence patterns
Discuss this preprint
Start a discussion What are Sciety discussions?Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
Purpose In a previous study, we trained, validated and tested models of endometrial cancer (EC) recurrence integrating clinical, genomic and pathological data from the Oncology Research Information Exchange Network (ORIEN). Preliminary studies also have demonstrated that bacterial communities may influence the risk of EC recurrence by altering the local environment within the upper female genital tract. The objective of this study was to evaluate whether extrinsic and environmental factors, including tumor-associated bacterial communities, tumor immune contexture and air pollution alongside clinical, pathologic and genomic features are associated with EC recurrence across clinically relevant risk groups. Patients and Methods: We performed a retrospective, multi-institution, case–control study with data from the ORIEN network EC dataset. Data was stratified into low-risk, FIGO grade 1 and 2, stage I (N = 329), high-risk, or FIGO grade 3 or stages II-IV (N = 324), and non-endometrioid histology (N = 239) groups. RNA and DNA were extracted from tumor specimens and processed to obtain the necessary genomic/metagenomic data. Genus level microbiome data were extracted and curated) from RNA sequencing using Kraken2 , Bracken and exotic software packages. Risk of EC recurrence was evaluated by integrating microbiome and environmental data alongside existing clinical, pathological and genomic data using topic modelling with latent dirichlet allocation (LDA). Prediction models of EC recurrence were created using machine and deep learning analytics (ML and DL) with MATLAB apps and TensorFlow . Finally, performance of both topic and prediction models were externally validated in an independent EC dataset from TCGA. Results The resulting models, analyzed with topic modelling, demonstrated the complexity of factors involved in recurrence of disease for EC. The components of the resulting topic models, and specifically the microbiome, changed when environmental factors, like air pollutants, were introduced in the model. In the low-risk EC group, microbes that were quite abundant in models before introducing environmental factors, were scarcely seen afterwards, like genera Thermothielavioides , Theileria , Rhizoctonia . Bacillus was the genus with higher per-topic probability within all risk groups, especially for low-risk EC (28%). Ozone (O 3 ) was a resulting component of all risk groups’ models. BMI was the sole informative clinical variable after data integration, and only present in the low-risk group. Resulting models from the high-risk and non-endometrioid groups included differential gene expressions: MMP13, S100A7, SMOC1, ACACA and ADD2, DLX5, SLCO2B1, NWD1 respectively. CNVs also were present in both low-risk and non-endometrioid groups, but their per-topic probabilities were low. The same was true for the immune contexture data. The components of the resulting topic models were used to train, validate and test prediction models of EC recurrence by risk groups. Performances of these models were excellent (@ 0.9). Despite some missing microbiome data in TCGA from resulting topic models, prediction models trained in the ORIEN set, had similar performances in TCGA testing set, with overlapping AUC 95% CIs. Conclusion Both extrinsic factors (tumor-associated bacterial communities, tumor immune contexture and air pollution) and intrinsic factors predict EC recurrence. The complexity of tumor and host factors influencing cancer relapses underscore the need for more individualized prediction models of disease outcomes.