Comparative Study of Machine Learning Techniques for Diabetes Forecasting

Abdul Aamir Khan
Bk Sharma

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

The rising global prevalence of diabetes has intensified the need for accurate and early diagnostic systems. As a significant global health concern, diabetes requires effective and precise prediction techniques. This study reviews research that utilizes clinical data and machine learning (ML) approaches for diabetes prediction. Common pre-processing steps include categorical data encoding, handling missing values, and normalization. To enhance model performance, dimensionality reduction techniques such as Principal Component Analysis (PCA) and feature selection are employed. Performance metrics—such as accuracy, precision, recall, F1-score, and AUC-ROC—are used to evaluate and compare various supervised learning algorithms, including Random Forest, Support Vector Machines (SVM), k-Nearest Neighbors (k-NN), Logistic Regression, and Decision Trees. Many studies use small datasets, which limits generalizability despite reporting high accuracy. This study underscores the need for diverse datasets and clinically interpretable models, while also highlighting gaps in model interpretability and validation practices.

Version published to 10.21203/rs.3.rs-7145782/v1 on Research Square
Jul 22, 2025

Diabetes Prediction Through Machine Learning and Ontology

This article has 5 authors:
1. Vishal A. Wankhede¹
2. Anant R. More
3. Pankaj S. Desai
4. Nilesh R. Thakre
5. Swati M. Bachhav
This article has no evaluationsLatest version Mar 26, 2026
An Intelligent AI-Driven Framework for Early Prediction of Heart Disease Using Advanced Machine Learning Techniques

This article has 2 authors:
1. Akshata K
2. Dharshini K
This article has no evaluationsLatest version Apr 7, 2026
A Machine Learning–Driven Health Risk Index for Predicting Chronic Disease Burden

This article has 1 author:
1. Ved Sharma
This article has no evaluationsLatest version Apr 2, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Diabetes Prediction Through Machine Learning and Ontology

An Intelligent AI-Driven Framework for Early Prediction of Heart Disease Using Advanced Machine Learning Techniques

A Machine Learning–Driven Health Risk Index for Predicting Chronic Disease Burden