Consensus Clustering for Robust Bioinformatics Analysis

Behnam Yousefi
Benno Schwikowski

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Clustering plays an important role in a multitude of bioinformatics applications, including protein function prediction, population genetics, and gene expression analysis. The results of most clustering algorithms are sensitive to variations of the input data, the clustering algorithm and its parameters, and individual datasets. Consensus clustering (CC) is an extension to clustering algorithms that aims to construct a robust result from those clustering features that are invariant under the above sources of variation. As part of CC, stability scores can provide an idea of the degree of reliability of the resulting clustering. This review structures the CC approaches in the literature into three principal types, introduces and illustrates the concept of stability scores, and illustrates the use of CC in applications to simulated and real-world gene expression datasets. Open-source R implementations for each of these CC algorithms are available in the GitHub repository: https://github.com/behnam-yousefi/ConsensusClustering

Version published to 10.1101/2024.03.21.586064 on bioRxiv
Mar 23, 2024

Spectral Clustering for Interdisciplinary Research: From Graph Theory to RNA-seq Data Analysis

This article has 2 authors:
1. Benjamin Couéraud
2. Enikő Regényi
This article has no evaluationsLatest version Dec 11, 2025
Pathway-Centric Global Expression Profiling Reveals Key Molecular Drivers in Hepatocellular Carcinoma

This article has 2 authors:
1. Raghavendra Krishnappa
2. Kanthesh M B
This article has no evaluationsLatest version Jan 8, 2026
Optimizing bioinformatic workflows to extract clinically usable gene expression data from targeted RNA sequencing panels: comparison with total RNAseq

This article has 12 authors:
1. Xiaokang Pan
2. Ashley Patton
3. Yi Seok Chang
4. Ryan Stevens
5. Nehad Mohamed
6. Matthew Hunt
7. Daniel Chappell
8. Yan Hu
9. Cecelia Miller
10. Weiqiang Zhao
11. Matthew Avenarius
12. Dan Jones
This article has no evaluationsLatest version Feb 3, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Spectral Clustering for Interdisciplinary Research: From Graph Theory to RNA-seq Data Analysis

Pathway-Centric Global Expression Profiling Reveals Key Molecular Drivers in Hepatocellular Carcinoma

Optimizing bioinformatic workflows to extract clinically usable gene expression data from targeted RNA sequencing panels: comparison with total RNAseq