Towards Applicable Just-In-Time Defect Prediction: A Practical Perspective on Effort Awareness

Peter Bludau
Alexander Pretschner

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Reviewing software changes is crucial, as it mitigates the introduction of defects and thereby saves time and reduces costs. Just-in-time (JIT) defect prediction has emerged as an approach to support the review process by predicting the likelihood of defects in new commits. Effort-aware evaluations were proposed to better manage developers' limited time and to analyze the applicability of JIT defect prediction approaches. However, current effort-aware approaches neglect the time-dependent nature of software engineering when evaluating the performance and rank commits to find most defective commits with limited effort. This reveals a gap between the evaluation of models in research and their application in practice, where a timely decision for every single commit is needed. Thus, evaluating models based on effort-aware evaluations does not reflect the actual performance in real-world scenarios and may misrepresent their applicability. In order to evaluate if state-of-the-art JIT defect prediction approaches are applicable, we address these limitations and suggest a more practical evaluation strategy that shifts the focus to conserving effort while ensuring that defective commits are still examined. We further elaborate on the possibility to tune the model prediction and allow for a more reliable application of JIT defect prediction models. Conducting a case study of six open-source projects, we find that the additional performance metrics allow new insights into the potential of JIT defect prediction and state-of-the-art defect prediction can be applicable under the right assumptions. Taking this new perspective, while still reviewing 95% of defective commits, on average, 46% of the non-defective commits are correctly identified by JIT defect prediction models and can be skipped. Thus, while maintaining high recall levels, 20% of the total avoidable effort can be saved employing JIT defect prediction models.

Version published to 10.21203/rs.3.rs-4716570/v1 on Research Square
Aug 5, 2024

AI-Powered Defect Prediction: From Code Smells to Failure Forecasting

This article has 5 authors:
1. Md Mostafizur Rahman
2. Md Mostafijur Rahman
3. Maria Khatun Shuvra
4. Md Mashfiquer Rahman
5. Najmul Gony
This article has no evaluationsLatest version Jun 9, 2025
The Role of Explainable AI in Automated Software Testing: Opportunities and Challenges

This article has 2 authors:
1. Md. Badiuzzaman Biplob
2. Abdur Rahman
This article has no evaluationsLatest version Jul 1, 2025
Software Engineering Challenges in the Deployment of Generative AI Models at Scale

This article has 1 author:
1. Devisharan Mishra
This article has no evaluationsLatest version Jun 17, 2025

Listed in

Abstract

Article activity feed

Related articles

AI-Powered Defect Prediction: From Code Smells to Failure Forecasting

The Role of Explainable AI in Automated Software Testing: Opportunities and Challenges

Software Engineering Challenges in the Deployment of Generative AI Models at Scale