Improving Ensemble Models for Software Defect Prediction: a study applying preprocessing techniques

Bianca P. R. Vieira
Rogério E. Garcia

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Defect prediction in software is a practice to improve the quality of software. However, the methods proposed to detect defects efficiently have challenges. Methods based on mining software repositories face challenges like the high dimensionality of data sets and the imbalance of datasets from software repositories. The need to deal with imbalanced data scenarios and large feature sets motivates the search to improve defect prediction models' effectiveness. Related works have studied ensemble models, feature selection, and imbalanced data, but have not analyzed their individual and combined impact with real-world datasets. The general purpose is to enhance the mining of software repositories to detect defects. We collected data from three open-source repositories, preprocessed using feature selection and data balance techniques, and developed models to compare with the same model algorithms but without preprocessing. The results are promising, showing improvement on final general metrics, as well as the metrics for the minority class. All the code developed in this research is available in the GitHub repository SoftDefectProcess

Version published to 10.21203/rs.3.rs-7483430/v1 on Research Square
Sep 19, 2025

Improving effort-aware defect prediction using machine learning methods

This article has 3 authors:
1. Alireza Mahdibarzi
2. Amirfarhad Farhadi
3. Azadeh Zamanifar
This article has no evaluationsLatest version Sep 9, 2025
Actionable Insights from Developer Behavior: A Practical Approach to Software Defect Prediction

This article has 2 authors:
1. Carlos Andres Ramirez Catano
2. Makoto Itoh
This article has no evaluationsLatest version Sep 8, 2025
Refactoring in Software Maintenance and Development: Application with Case Study

This article has 3 authors:
1. Rahmon Ariyo Badru
2. Akorede Mojeed Shittu
3. Idowu Olugbenga Adewumi
This article has no evaluationsLatest version Sep 9, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Improving effort-aware defect prediction using machine learning methods

Actionable Insights from Developer Behavior: A Practical Approach to Software Defect Prediction

Refactoring in Software Maintenance and Development: Application with Case Study