Predicting software defects: a comprehensive analysis of machine learning approaches

Masoud Sistaninezhad
Saman Rajebi
Shahrzad Pouramirarsalani
Sajjad Pakzad
Houshyar Asadi
Siamak Pedrammehr

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

In software development, achieving flawless software is essential for maintaining quality and reducing testing costs. Predicting software defects is a crucial aspect of enhancing software quality. This paper explores various techniques, including feature selection, principal component analysis, and fisher discriminant ratio, utilizing well-known machine learning algorithms on the publicly available JM1 dataset, addressing the gap in the current literature. support vector machine, multi-layer perceptron, K-nearest neighbor, Naïve Bayes, and decision tree algorithms are utilized along with the K-Fold approach for class label classification. Additionally, a binary genetic algorithm with a support vector machine classifier is employed for feature selection, and a particle swarm optimization algorithm is used to determine optimal fisher discriminant ratio coefficients. Model performance is evaluated according to accuracy, sensitivity, specificity, F-measure, precision, and a confusion matrix. The findings indicate that all machine learning models perform well with different processing techniques. However, the support vector machine algorithm, when combined with optimal fisher discriminant ratio coefficients, achieved the highest accuracy at 88.2% and excelled in specificity at 99.6%. The K-nearest neighbor classifier with selected features attained the highest scores in precision, sensitivity, and F-measure. Other classification algorithms did not surpass these models in any performance metrics.

Version published to 10.21203/rs.3.rs-5006431/v1 on Research Square
Oct 7, 2024

Enhancing malware detection reliability in non-executable files using confidence score prediction

This article has 4 authors:
1. Rasoul Rezvani-Jalal
2. Morteza Zakeri
3. Saeed Parsa
4. Amin Hasan-Zarei
This article has no evaluationsLatest version May 15, 2025
Investigating the Role of Feature Variation and Data Transformations of Different Types of Machine Learning Algorithms in Classifying Benign - Malignant Breast Cancer

This article has 3 authors:
1. Anak Agung Ngurah Gunawana
2. Putu Astri Novianti
3. Anak Agung Ngurah Frady Cakra Negara
This article has no evaluationsLatest version May 5, 2025
AI-Powered Defect Prediction: From Code Smells to Failure Forecasting

This article has 5 authors:
1. Md Mostafizur Rahman
2. Md Mostafijur Rahman
3. Maria Khatun Shuvra
4. Md Mashfiquer Rahman
5. Najmul Gony
This article has no evaluationsLatest version Jun 9, 2025

Listed in

Abstract

Article activity feed

Related articles

Enhancing malware detection reliability in non-executable files using confidence score prediction

Investigating the Role of Feature Variation and Data Transformations of Different Types of Machine Learning Algorithms in Classifying Benign - Malignant Breast Cancer

AI-Powered Defect Prediction: From Code Smells to Failure Forecasting