Optimizing the Collection Process in Credit Risk Management: A Comparison of Machine Learning Techniques for Predicting Payment Probability at Different Stages of Arrears

Andrés Carrera
Marco E. Benalcázar

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

In credit risk, scoring models based on logistic regression have been developed to optimize the default risk assessment. However, these models require complex feature engineering, and their accuracy worsens as the arrears progresses. This study proposes the use of machine learning techniques (XGBoost and artificial neural networks) to generate scores in different arrears segments (No Arrears Segment, 1–30 Days of Arrears Segment, 31–90 Days of Arrears Segment, and All Segments). The Kolmogorov–Smirnov (KS) metric is used to assess the efficiency and predictive power of the models. To ensure the accuracy and reliability of the models, a five-step methodology is employed. It starts with the formulation of the problem, followed by the selection of a data sample and definition of the target variable, then a descriptive analysis of the data is performed to facilitate the data cleaning. Subsequently, the models are trained and tested, and finally, the results are analyzed, and the models obtained are interpreted. The results show that both XGBoost and artificial neural network models outperform logistic regression in most of the arrears segments. In the No Arrears Segment, the XGBoost model is the best with KS = 63.36%. In the 1–30 Segment, XGBoost is also the best with KS = 51.38%. In the 31–90 Segment, the artificial neural network model is the best with KS = 38.77%. Finally, with all segments of arrears, the XGBoost model is again the best with KS = 74.05%.

Version published to 10.3390/jrfm18110630
Nov 10, 2025
Version published to 10.20944/preprints202508.0385.v1
Aug 6, 2025

Applying Multiple Linear Regression to Enhance Short-Term Stock Forecasting Accuracy

This article has 2 authors:
1. TOUSIF AL RASHID
2. Raj Kumar
This article has no evaluationsLatest version Dec 15, 2025
A Unified Machine Learning Framework for Enterprise Portfolio Forecasting, Risk Detection, and Automated Reporting

This article has 1 author:
1. Ashutosh Agarwal
This article has no evaluationsLatest version Dec 10, 2025
Machine Learning to Detect Abnormal Delivery Performance in Supply Chain Operations

This article has 1 author:
1. Gita Ziabari
This article has no evaluationsLatest version Dec 19, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Applying Multiple Linear Regression to Enhance Short-Term Stock Forecasting Accuracy

A Unified Machine Learning Framework for Enterprise Portfolio Forecasting, Risk Detection, and Automated Reporting

Machine Learning to Detect Abnormal Delivery Performance in Supply Chain Operations