A Comprehensive Survey on Clustering Algorithms: Concepts, Taxonomy with Nature-Inspired Meta-Heuristic Approaches and Performance Metrics

Yuvaraj M
Sivaprakash S

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Clustering is a core technique in unsupervised learning that organizes unlabeled data into meaningful groups based on similarity. It has wide applications in domains such as bioinformatics, pattern recognition, social network analysis, computer vision, and artificial intelligence. Owing to the diversity and complexity of real-world datasets, numerous clustering paradigms have been developed, each with specific advantages and limitations. This survey provides a structured overview of classical and advanced clustering approaches, including hierarchical, partition-based, density-based, model-based, subspace, grid-based, and search-based metaheuristic techniques. We further examine commonly used similarity measures and validation metrics, including internal and external evaluation criteria, to highlight their role in assessing clustering quality. A comparative taxonomy is presented to clarify algorithmic characteristics, scalability, robustness, and parameter sensitivity under varying data conditions. Despite significant progress, challenges remain in handling noisy and high-dimensional data, determining the optimal number of clusters, and ensuring computational efficiency. Emerging directions such as hybrid frameworks, self-supervised learning, and multi-view clustering offer promising avenues for developing more adaptive and scalable clustering solutions.

Version published to 10.21203/rs.3.rs-8771280/v1 on Research Square
Mar 6, 2026

Deep Clustering via Gradual Community Detection

This article has 5 authors:
1. Tianyu Cheng
2. Na Chen
3. Siyi Yang
4. Chuyi Fan
5. Qun Chen
This article has no evaluationsLatest version Mar 24, 2026
SE-GCL: Structure-Aware Graph Clustering with Entropy Minimization

This article has 7 authors:
1. Guoteng Xu
2. Xiudian Zhang
3. Lingjie Wang
4. Jianjiang Liu
5. Hanlin Tang
6. Chengjiang Li
7. Zhi Ouyang
This article has no evaluationsLatest version Mar 24, 2026
A Dynamic Cost-Deviation Greedy Algorithm for Large-Scale Assignment Problems

This article has 3 authors:
1. Yash Kumar
2. Ramesh Chandra Sahoo
3. Prashant Dixit
This article has no evaluationsLatest version Mar 13, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Deep Clustering via Gradual Community Detection

SE-GCL: Structure-Aware Graph Clustering with Entropy Minimization

A Dynamic Cost-Deviation Greedy Algorithm for Large-Scale Assignment Problems