Fraud Detection in Online Transactions: Toward Hybrid Supervised–Unsupervised Learning Pipelines

Shuo Xu
Yuchen Cao
Zhongyan Wang
Yexin Tian

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Fraud detection in online transactions presents a challenging task due to the rarity of fraudulent events and the evolving nature of fraud strategies. This study presents a comparative analysis of three supervised machine learning models, Logistic Regression, Random Forest, and LightGBM, for detecting fraudulent transactions in an extremely imbalanced dataset. We evaluate each model under both standardized and raw feature preprocessing settings using macro-averaged metrics and AUC. Our findings show that ensemble-based models, particularly LightGBM, significantly outperform linear baselines and exhibit robustness to feature scaling. Additionally, we assess K-Means clustering as an unsupervised baseline, but observe that it fails to meaningfully separate fraud cases, suggesting the need for more informative features or hybrid learning approaches. These results offer practical insights into model selection, preprocessing, and the trade-offs between precision and recall in real-world fraud detection systems.

Version published to 10.20944/preprints202505.1101.v1
May 14, 2025

Effective Credit Card Fraud Detection Using Data Mining Techniques

This article has 1 author:
1. Jing Xian Ooi
This article has no evaluationsLatest version May 9, 2025
Online Banking Fraud Detection Model: Decentralized Machine Learning Framework to Enhance Effectiveness and Compliance with Data Privacy Regulations

This article has 2 authors:
1. Hisham AbouGrad
2. Lakshmi Sankuru
This article has no evaluationsLatest version May 6, 2025
Towards a Feed-Forward Neural Network for Financial Fraud Detection

This article has 1 author:
1. Oyindamola Ogunruku
This article has no evaluationsLatest version May 6, 2025

Listed in

Abstract

Article activity feed

Related articles

Effective Credit Card Fraud Detection Using Data Mining Techniques

Online Banking Fraud Detection Model: Decentralized Machine Learning Framework to Enhance Effectiveness and Compliance with Data Privacy Regulations

Towards a Feed-Forward Neural Network for Financial Fraud Detection