RapCluster: Bridging the Reproducibility Gap in Clustering Analysis
Discuss this preprint
Start a discussion What are Sciety discussions?Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
Clustering is ubiquitous across science, yet a text-mining audit of 736,399 open-access articles identified as using clustering (2000-2025) reveals common practice leaves key parameters undocumented or untuned, contributing to the reproducibility crisis in science. We developed an interactive web platform featuring 11 widely adopted clustering algorithms to enable transparent clustering analysis and reporting, aligning practical use with best practices in computational research.
Code availability
The browser-based clustering analysis platform RapCluster is available for download at https://github.com/lutfia95/RapCluster under MIT License and accessible at https://rappsilberlab.org/rapcluster/ The web version is capacity-limited to 8 GB of memory (ca. 12,000 candidates with 130 features). All source codes used for the analysis are included with the publication.