Robust Dual Space Factorization for Unsupervised Feature Selection (RDSF-UFS)
Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
Unsupervised feature selection is critical in high-dimensional data analysis, as it identifies the most informative features without relying on label information. In this paper, we propose a novel Robust Dual Space Factorization for Unsupervised Feature Selection (RDSF-UFS) framework based on Symmetric Nonnegative Matrix Factorization (SNMF) and Nonnegative Matrix Tri-Factorization (NMTF). The proposed method explores sample and feature affinity matrices to capture intrinsic data structures in a dual latent space. Specifically, we introduce a robust objective function incorporating the Frobenius and í µí°¿ 2,1 norms to enhance resistance to noise and outliers. We further impose an orthogonality constraint on the latent factor matrices to promote sparsity and improve clustering performance. An efficient optimization algorithm is derived to iteratively update the latent factors by solving the associated gradient-based equations. We evaluated the performance of RDSFUFS on eight benchmark datasets, including biological microarray, face image, speech signal, and digit image datasets. Experimental results demonstrate that RDSFUFS outperforms state-of-the-art feature selection methods in terms of Normalized Mutual Information (NMI) and Accuracy (ACC), highlighting its ability to uncover meaningful feature patterns and improve clustering performance.