Adaptive Synthetic Minority Oversampling Technique with Density-Guided Noise Injection and Local Density Adaptation

Zaitinkhuma Thihlum
Vanlal hruaia
V. D. Ambeth Kumar
R Chawngsangpuii

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Class imbalance remains a persistent challenge in supervised learning, often leading to biased classifiers and poor detection of minority instances. This paper introduces Adaptive Synthetic Minority Oversampling Technique with Guided Density (AdaptiveSMOTEGD), a novel method that integrates local density-based sparsity detection, tunable Gaussian noise injection, and domain-specific constraint preservation. Unlike conventional methods such as Synthetic Minority Oversampling Technique (SMOTE), Adaptive Synthetic Sampling Approach (ADASYN), Borderline-SMOTE, Synthetic Minority Over-sampling Technique for Nominal and Continuous features (SMOTENC), Support Vector Machine SMOTE (SVMSMOTE), and KMeans-SMOTE, the proposed approach selectively targets sparse minority regions while avoiding degradation in dense areas. It also supports datasets with purely numerical features as well as those containing both numerical and categorical attributes. Experimental evaluation on eight numerical-only and six mixed-type benchmark datasets using Light Gradient Boosting Machine (LightGBM) demonstrates that AdaptiveSMOTEGD consistently achieves competitive or superior performance in F1-score, recall, Matthews Correlation Coefficient (MCC), and area under the precision-recall curve (AUC-PR), particularly under highly imbalanced and noisy conditions. Statistical analysis confirms significant improvements in recall for both numerical-only and mixed datasets, establishing AdaptiveSMOTEGD as a robust, scalable, and versatile solution for real-world imbalanced classification problems.

Version published to 10.21203/rs.3.rs-7945642/v1 on Research Square
Oct 29, 2025

A Robust Classifier for Label Noise Using Random Forest Kernel

This article has 3 authors:
1. Shihab Shahriar Khan
2. Ahmedul Kabir
3. Muhammad Ibrahim
This article has no evaluationsLatest version Oct 14, 2025
Comparative Deep Learning Analysis of Regularization Techniques on Generalization in Baseline CNNs and ResNet Architectures for Machine Learning-Based Image Classification

This article has 5 authors:
1. Mohsen Mohammadagha
2. Farbod Bigdeli
3. Shayan Sharifi
4. Maryam Deldadehasl
5. Saeid Ataei
This article has no evaluationsLatest version Oct 20, 2025
A SMOTEENN-Powered Stacked Ensemble with Transformer-Based Meta-Learner for Balanced Diabetic Retinopathy Grading

This article has 2 authors:
1. Sujal Gupta
2. Suyash Kumar
This article has no evaluationsLatest version Nov 19, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

A Robust Classifier for Label Noise Using Random Forest Kernel

Comparative Deep Learning Analysis of Regularization Techniques on Generalization in Baseline CNNs and ResNet Architectures for Machine Learning-Based Image Classification

A SMOTEENN-Powered Stacked Ensemble with Transformer-Based Meta-Learner for Balanced Diabetic Retinopathy Grading