Track: Undergraduate Student Paper Competition
Abstract
In the last decades, private debt collection has become an important industry in Ecuador. Debt collection agencies are organizations that manage the process of collecting money from delinquent debts and their performance is often measured by the collection success rate. However, determining which clients will repay their liabilities is a complex process and many times subjectively judged. Machine learning (ML) models have been successfully implemented on the financing sector for various applications , however a limited amount of work has been published in the prediction of a client´s debt payment probability. In this study, ML models are trained to predict a client´s payment probability to an Ecuadorian Debt Collection Agency the first three month after signing a payment agreement. Specifically, a feedforward neural network, logistic regression, and gradient boosting ensemble models are implemented using the SEMMA data mining methodology, which comprises of the steps of sample, explore, modify, model, and assess. Furthermore, by analyzing the results of the models, relevant features that determine whether a costumer will pay or not its debt are identified. The results show that neural networks perform better than the competing models in terms of classification accuracy.