Forecasting Credit Card Attrition using Machine Learning Models

In recent years, credit card attrition has emerged as an issue of significant concern for the banking sector. It has a significant impact on profitability, given that the cost of acquiring new customers is higher than that of retaining existing customers. In this work, a selection of supervised Machine Learning models to identify which customers want to cancel their credit cards is evaluated. The banking industry uses this technology to obtain more reliable predictions when identifying opportunities for purchase, investment, or fraud. These models can be adapted independently, by recognizing patterns and algorithms based on mathematical calculations. Four models (LightGBM, XGBoost, Random Forest and Logistic Regression) were evaluated to predict, using data about customers and products held pertaining to a bank in Colombia, the likelihood of customers canceling their credit cards. By analyzing the ROC curves using the AUC metric, it is concluded that, of the selected models, the model chosen for deployment would be LightGBM, since it was the one that performed best in the experiments conducted. Furthermore, the “Score Acierta” variable, a customer rating provided by the Colombian credit rating agency, was found to be the most discriminating in prediction models.

[1]  Javier Jesús Espinosa Zúñiga Aplicación de algoritmos Random Forest y XGBoost en una base de solicitudes de tarjetas de crédito , 2020 .

[2]  A. Athanassopoulos Customer Satisfaction Cues To Support Market Segmentation and Explain Switching Behavior , 2000 .

[3]  Eric W. T. Ngai,et al.  Customer churn prediction using improved balanced random forests , 2009, Expert Syst. Appl..

[4]  V. Mittal,et al.  Satisfaction, Repurchase Intent, and Repurchase Behavior: Investigating the Moderating Effect of Customer Characteristics , 2001 .

[5]  Horacio Fernández Castaño,et al.  Modelo logístico: una herramienta estadística para evaluar el riesgo de crédito , 2005 .

[6]  Bart Baesens,et al.  Building comprehensible customer churn prediction models with advanced rule induction techniques , 2011, Expert Syst. Appl..

[7]  John Ruskin,et al.  Satisfaction , 2005, BMJ : British Medical Journal.

[8]  Tie-Yan Liu,et al.  LightGBM: A Highly Efficient Gradient Boosting Decision Tree , 2017, NIPS.

[9]  Dirk Van den Poel,et al.  Customer attrition analysis for financial services using proportional hazard models , 2004, Eur. J. Oper. Res..

[10]  Heeseok Lee,et al.  Customer Churn Prediction Based on Feature Clustering and Nonparallel Support Vector Machine , 2014, Int. J. Inf. Technol. Decis. Mak..

[11]  Rüdiger Wirth,et al.  CRISP-DM: Towards a Standard Process Model for Data Mining , 2000 .

[12]  M. Rezac,et al.  How to Measure the Quality of Credit Scoring Models , 2011 .

[13]  Yong Shi,et al.  Prediction of Customer Attrition of Commercial Banks based on SVM Model , 2014, ITQM.

[14]  Geoffrey Randall Principles of Marketing , 1993 .

[15]  K. Chitra,et al.  Customer Retention in Banking Sector using Predictive Data Mining Technique , 2011 .

[16]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[17]  Ajay K. Manrai,et al.  A field study of customers’ switching behavior for bank services ☆ , 2007 .

[18]  Andrew Marshall,et al.  Assessing the impact of derived behavior information on customer attrition in the financial service industry , 2014, Eur. J. Oper. Res..