Evaluating Dimensionality Reduction Techniques for Liver Disease Classification Using Unlabeled Data

Muhammad Abubakar

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Liver disease poses a significant health burden worldwide, necessitating early and accurate diagnosis to improve patient outcomes. However, real-world clinical datasets often contain high-dimensional and unlabeled features, complicating traditional classification approaches. This study investigates the effectiveness of dimensionality reduction techniques in enhancing liver disease classification performance using unsupervised data. We explore and compare principal component analysis (PCA), t-distributed stochastic neighbor embedding (t-SNE), and uniform manifold approximation and projection (UMAP) on a medical dataset containing physiological and biochemical attributes. Each technique is evaluated based on its ability to preserve the intrinsic data structure while improving downstream clustering and classification accuracy. The findings reveal that nonlinear methods, such as t-SNE and UMAP, offer superior separability of liver disease indicators in reduced-dimensional space compared to linear approaches, like PCA. This work highlights the potential of combining unsupervised learning and dimensionality reduction for efficient feature extraction in medical diagnostics, especially where labeled data are limited or unavailable.

Version published to 10.31219/osf.io/qbjhx_v1 on OSF Preprints
Jul 2, 2025

An enhanced explainable thyroid disease diagnosis by leveraging cluster-smote and machine learning models

This article has 4 authors:
1. Usman Suleh
2. Badamasi Alhaji Ahmed
3. Farouk Lawan Gambo
4. Fatima Umar Zambuk
This article has no evaluationsLatest version Jan 27, 2026
Smart Diagnosis: AI and ML Powered Breast Cancer Classification

This article has 2 authors:
1. Sagar Verma
2. Vaibhav Sabale
This article has no evaluationsLatest version Jan 28, 2026
Comparing Algorithm Effectiveness in Health Data Analysis

This article has 1 author:
1. Abdulmalik Hazaa Alshammari
This article has no evaluationsLatest version Jan 22, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

An enhanced explainable thyroid disease diagnosis by leveraging cluster-smote and machine learning models

Smart Diagnosis: AI and ML Powered Breast Cancer Classification

Comparing Algorithm Effectiveness in Health Data Analysis