CADENCE: Clustering Algorithm - Density-based Exploration and Novelty Clustering with Efficiency

Lexin Chen
Daniel R. Roe
Ramón Alain Miranda-Quintana

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Unsupervised learning techniques play a pivotal role in unraveling protein folding landscapes, constructing Markov State Models, expediting replica exchange simulations, and discerning drug binding patterns, among other applications. A fundamental challenge in current clustering methods lies in how similarities among objects are accessed. Traditional similarity operations are typically only defined over pairs of objects, and this limitation is at the core of many performance issues. The crux of the problem in this field is that efficient algorithms like k -means struggle to distinguish between metastable states effectively. However, more robust methods like density-based clustering demand substantial computational resources. Extended similarity techniques have been proven to swiftly pinpoint high and low-density regions within the data in linear O(N) time. This offers a highly convenient means to explore complex conformational landscapes, enabling focused exploration of rare events or identification of the most representative conformations, such as the medoid of the dataset. In this contribution, we aim to bridge this gap by introducing a novel density clustering algorithm to the Molecular Dynamics Analysis with N -ary Clustering Ensembles (MDANCE) software package based on n -ary similarity framework.

Version published to 10.1101/2025.02.24.639863 on bioRxiv
Feb 28, 2025

Density Peaks Clustering Algorithm Based on Natural Neighbor and Multi-Cluster Merging Strategy

This article has 3 authors:
1. Fang Wan
2. Lili Wei
3. Chao Shi
This article has no evaluationsLatest version Dec 12, 2025
GTcomplex: Spatial indexing-powered search and alignment of macromolecular complexes

This article has 1 author:
1. Mindaugas Margelevicius
This article has no evaluationsLatest version Jan 22, 2026
Reframing Population Genetic Structure as a Quantum Optimization Problem

This article has 1 author:
1. Andrew Davinack
This article has no evaluationsLatest version Dec 24, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Density Peaks Clustering Algorithm Based on Natural Neighbor and Multi-Cluster Merging Strategy

GTcomplex: Spatial indexing-powered search and alignment of macromolecular complexes

Reframing Population Genetic Structure as a Quantum Optimization Problem