Multiresolution Clustering of Genomic Data

Ali Turfah
Xiaoquan Wen

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Cluster analysis is a widely used unsupervised learning technique in genomic data analysis, with critical applications such as inferring genetic population structures and annotating cell types from single-cell RNA-seq data. However, most existing clustering methods focus on identifying a single optimal partition while overlooking intrinsic relationships among the inferred clusters. Moreover, clustering results produced by different algorithms often appear inconsistent, and there is a lack of principled approaches to extract shared, biologically meaningful patterns across diverse clustering outputs. In this work, we introduce a computational framework that enables systematic exploration of multi-resolution clustering structures in genomic data, starting from an initial configuration generated by any available clustering algorithm. The proposed algorithm provides a unified and principled approach for uncovering complex, nested biological relationships and reconciling discrepancies among clustering results. We demonstrate the utility of our framework through comprehensive simulations and applications to both genetic and single-cell transcriptomic datasets, highlighting its ability to recover interpretable and reproducible clustering structures. Furthermore, we show that our multi-resolution cluster analysis of complex genomic data yields valuable insights into patterns of human population migration and cell differentiation trajectories.

Version published to 10.1101/2025.06.13.659529 on bioRxiv
Jun 18, 2025

Spectral Clustering for Interdisciplinary Research: From Graph Theory to RNA-seq Data Analysis

This article has 2 authors:
1. Benjamin Couéraud
2. Enikő Regényi
This article has no evaluationsLatest version Dec 11, 2025
Self-supervised Graph Contrastive Learning for scRNA-seq Clustering

This article has 1 author:
1. Tong Wu
This article has no evaluationsLatest version Dec 11, 2025
Understanding Pathways in Bioinformatics, Genomics, and Health Applications

This article has 1 author:
1. Diptarup Mallick
This article has no evaluationsLatest version Jan 19, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Spectral Clustering for Interdisciplinary Research: From Graph Theory to RNA-seq Data Analysis

Self-supervised Graph Contrastive Learning for scRNA-seq Clustering

Understanding Pathways in Bioinformatics, Genomics, and Health Applications