A multi-omic, spatial, and whole-slide image dataset of lung neuroendocrine tumours from the lungNENomics cohort

Read the full article See related articles

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.
Log in to save this article

Abstract

Lung neuroendocrine tumours (lung NETs) are rare neoplasms comprising approximately 2% of lung cancers. Recent studies have identified distinct molecular groups based on transcriptome and methylome data, but genomic and morphological features remain underexplored due to limited whole-genome and imaging data. We have generated the largest multi-omic dataset of lung NETs to date (201 participants, for a total of n = 294 tumours), including RNA sequencing, EPIC 850K methylation arrays, and whole-genome sequencing. This multiomic dataset also include multi-regional whole-genome sequencing for 41 participants, allowing for the quantification of intra-tumoural heterogeneity. We additionally generated spatial proteomics (64 participants), spatial transcriptomics (4 participants) and whole-slide histopathology images for 212 cases. This dataset enables a comprehensive characterization of lung NET molecular groups and the identification of group-specific morphological features using deep learning algorithms. All quality control analyses, processed data, and scripts are provided to ensure reproducibility. This dataset is available as a basis for further molecular and morphological analysis of lung NETs, and for future research on multi-scale integration.

Article activity feed