Within-Project and Cross-Project Defect Prediction Based on Model Averaging

tong li
zhong wang
peibei shi

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Software defect prediction has an important impact on the national economy and financial service industry. Discovering defective modules in the early stage of software development has great significance. This paper proposes a within-project and cross-project defect prediction technology based on model averaging, which uses XGBoost and LightGBM algorithms in machine learning as candidate models and introduces model averaging theory to improve performance. First, two candidate models are used for probability prediction, and then each group is used as a test dataset to evaluate the model by the cross-validation method. Then, the model weight is determined by minimizing the sum of the squared prediction errors of all groups, and finally, the predicted probability of model averaging is obtained. Four typical public software defect datasets (NASA, AEEEM, ReLink, SoftLab) are used as test datasets, and the four indicators, precision, recall, F1 and AUC are used as evaluation criteria. For within-project defect prediction, compared with the XGBoost and LightGBM algorithms, the prediction results of the model averaging method on the four datasets are slightly better than the XGBoost and LightGBM algorithms, which also corresponds with the ensemble learning idea of model averaging theory. Compared with the six traditional machine learning algorithms, the model average prediction method performed best on most of the data. For cross-project defect prediction, compared with the four benchmark methods, the model averaging method performs better overall. The experimental results show that the model averaging prediction method achieves good prediction results in both the within-project and cross-project defect scenarios.

Version published to 10.21203/rs.3.rs-4734176/v1 on Research Square
Aug 7, 2024

Machine Learning Based Approach for Software Defect Prediction using Hyperparameter

This article has 2 authors:
1. Digvijay Narayan Sharma
2. Dilip Kumar Yadav
This article has no evaluationsLatest version Jul 22, 2024
Towards Applicable Just-In-Time Defect Prediction: A Practical Perspective on Effort Awareness

This article has 2 authors:
1. Peter Bludau
2. Alexander Pretschner
This article has no evaluationsLatest version Aug 5, 2024
Gini calculation and rule performance interpretation

This article has 1 author:
1. Meenal Badki
This article has no evaluationsLatest version Jul 30, 2024

Listed in

Abstract

Article activity feed

Related articles

Machine Learning Based Approach for Software Defect Prediction using Hyperparameter

Towards Applicable Just-In-Time Defect Prediction: A Practical Perspective on Effort Awareness

Gini calculation and rule performance interpretation