Harnessing Exploratory Data Analysis for Robust Financial Fraud Detection and Model Enhancement

Mark Lokanan

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

This paper explores the critical role of Exploratory Data Analysis (EDA) in detecting fraud and ensuring robust machine learning model performance. By applying both univariate and multivariate EDA techniques, including graphical and non-graphical methods, key trends and relationships within the dataset were uncovered. The analysis reveals significant variability in financial data associated with fraud cases, particularly highlighting the increased scale of fraud in the early 2000s. The EDA process facilitated the identification of outliers, correlations, and potential data quality issues, such as missing values and inconsistencies. Additionally, EDA informed the necessary data transformations and feature engineering steps that ultimately improved the performance of machine learning models. Using Random Forest and Classification and Regression Trees algorithms, the models demonstrated strong classification accuracy and generalized effectively to new data. The findings underscore the importance of EDA in the data modeling process, particularly in fraud detection, where understanding underlying patterns and relationships is essential for developing reliable predictive models.

Version published to 10.21203/rs.3.rs-5635767/v1 on Research Square
Dec 16, 2024

Mining Financial Data for Fraud Detection using Ensemble Learning and Outlier Detection

This article has 2 authors:
1. Manimegalai R
2. Vijayalaskhmi P
This article has no evaluationsLatest version Dec 10, 2025
Predictive Analysis of Bank Marketing Data for Customer Response

This article has 2 authors:
1. Yash Mishra
2. Kedarnath senapati
This article has no evaluationsLatest version Jan 18, 2026
Construction and analysis of data model for financial market volatility prediction based on support vector machine

This article has 1 author:
1. XiaoMeng Su
This article has no evaluationsLatest version Jan 21, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Mining Financial Data for Fraud Detection using Ensemble Learning and Outlier Detection

Predictive Analysis of Bank Marketing Data for Customer Response

Construction and analysis of data model for financial market volatility prediction based on support vector machine