TuBA: Tunable biclustering algorithm reveals clinically relevant tumor transcriptional profiles in breast cancer

This article has been Reviewed by the following groups

Read the full article

Abstract

Background

Traditional clustering approaches for gene expression data are not well adapted to address the complexity and heterogeneity of tumors, where small sets of genes may be aberrantly co-expressed in specific subsets of tumors. Biclustering algorithms that perform local clustering on subsets of genes and conditions help address this problem. We propose a graph-based Tunable Biclustering Algorithm (TuBA) based on a novel pairwise proximity measure, examining the relationship of samples at the extremes of genes' expression profiles to identify similarly altered signatures.

Results

TuBA's predictions are consistent in 3,940 breast invasive carcinoma samples from 3 independent sources, using different technologies for measuring gene expression (RNA sequencing and Microarray). More than 60% of biclusters identified independently in each dataset had significant agreement in their gene sets, as well as similar clinical implications. Approximately 50% of biclusters were enriched in the estrogen receptor−negative/HER2-negative (or basal-like) subtype, while >50% were associated with transcriptionally active copy number changes. Biclusters representing gene co-expression patterns in stromal tissue were also identified in tumor specimens.

Conclusions

TuBA offers a simple biclustering method that can identify biologically relevant gene co-expression signatures not captured by traditional unsupervised clustering approaches. It complements biclustering approaches that are designed to identify constant or coherent submatrices in gene expression datasets, and outperforms them in identifying a multitude of altered transcriptional profiles that are associated with observed genomic heterogeneity of diseased states in breast cancer, both within and across tumor subtypes, a promising step in understanding disease heterogeneity, and a necessary first step in individualized therapy.

Article activity feed

  1. Now published in GigaScience doi: 10.1093/gigascience/giz064

    Amartya Singh 1Department of Physics and Astronomy, Rutgers University, Piscataway, NJ, USA2Center for Systems and Computational Biology, Rutgers Cancer Institute, Rutgers University, New Brunswick, NJ, USAFind this author on Google ScholarFind this author on PubMedSearch for this author on this siteGyan Bhanot 1Department of Physics and Astronomy, Rutgers University, Piscataway, NJ, USA2Center for Systems and Computational Biology, Rutgers Cancer Institute, Rutgers University, New Brunswick, NJ, USA3Department of Molecular Biology and Biochemistry, Rutgers University, Piscataway, NJ, USAFind this author on Google ScholarFind this author on PubMedSearch for this author on this siteHossein Khiabanian 1Department of Physics and Astronomy, Rutgers University, Piscataway, NJ, USA2Center for Systems and Computational Biology, Rutgers Cancer Institute, Rutgers University, New Brunswick, NJ, USA3Department of Molecular Biology and Biochemistry, Rutgers University, Piscataway, NJ, USA4Department of Pathology and Laboratory Medicine, Rutgers Robert Wood Johnson Medical School, Rutgers University, New Brunswick, NJ, USAFind this author on Google ScholarFind this author on PubMedSearch for this author on this siteORCID record for Hossein Khiabanian

    A version of this preprint has been published in the Open Access journal GigaScience (see paper https://doi.org/10.1093/gigascience/giz064 ), where the paper and peer reviews are published openly under a CC-BY 4.0 license.

    These peer reviews were as follows:

    Reviewer 1: http://dx.doi.org/10.5524/REVIEW.101761 Reviewer 2: http://dx.doi.org/10.5524/REVIEW.101762 Reviewer 3: http://dx.doi.org/10.5524/REVIEW.101763