Identifying Unique Patient Groups in Melasma Using Clustering: A Retrospective Observational Study with Machine Learning Implications for Targeted Therapies
Discuss this preprint
Start a discussion What are Sciety discussions?Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
Melasma management is challenged by heterogeneity in patient presentation, particularly among individuals with darker skin tones. This study applied k-means clustering, an unsupervised machine learning algorithm that partitions data into k distinct clusters based on feature similarity, to identify patient subgroups that could provide a hypothesis-generating framework for future precision strategies. We analysed clinical and demographic data from 150 South African women with melasma using k-means clustering. The optimal number of clusters was determined using the Elbow Method and Bayesian Information Criterion (BIC), with t-distributed stochastic neighbour embedding (t-SNE) visualization for assessment. The k-Means algorithm identified seven exploratory patient clusters explaining 52.6% of the data variability (R2 = 0.526), with model evaluation metrics including BIC = 951.630 indicating optimal model fit and a Silhouette Score of 0.200 suggesting limited separation between clusters consistent with overlapping clinical phenotypes, while the Calinski-Harabasz index of 26.422 confirmed relatively well-defined clusters that were characterized by distinct profiles including “The Moderately Sun Exposed Young Women”, “Elderly Women with Long-Term Melasma”, and “Younger Women with Severe Melasma”, with key differentiators being age distribution and menopausal status, melasma severity and duration patterns, sun exposure behaviours, and quality of life impact profiles that collectively define the unique clinical characteristics of each subgroup. This study demonstrates how machine learning can identify clinically relevant patient subgroups in melasma. Aligning interventions with the characteristics of specific clusters can potentially improve treatment efficacy.