Consumer Credit Risk Models Via Machine-Learning Algorithms

We apply machine-learning techniques to construct nonlinear nonparametric forecasting models of consumer credit risk. By combining customer transactions and credit bureau data from January 2005 to April 2009 for a sample of a major commercial bank's customers, we are able to construct out-of-sample forecasts that significantly improve the classification rates of credit-card-holder delinquencies and defaults, with linear regression R2's of forecasted/realized delinquencies of 85%. Using conservative assumptions for the costs and benefits of cutting credit lines based on machine-learning forecasts, we estimate the cost savings to range from 6% to 25% of total losses. Moreover, the time-series patterns of estimated delinquency rates from this model over the course of the recent financial crisis suggest that aggregated consumer credit-risk analytics may have important applications in forecasting systemic risk.

[1]  G. G. Ide,et al.  Debit or Credit , 1919, The Psychological clinic.

[2]  David G. Stork,et al.  Pattern Classification , 1973 .

[3]  J. R. Landis,et al.  The measurement of observer agreement for categorical data. , 1977, Biometrics.

[4]  Yoav Freund,et al.  Experiments with a New Boosting Algorithm , 1996, ICML.

[5]  David J. Hand,et al.  Statistical Classification Methods in Consumer Credit Scoring: a Review , 1997 .

[6]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[7]  Arnoud Boot Relationship Banking: What Do We Know? , 2000 .

[8]  J. Galindo,et al.  Credit Risk Assessment Using Statistical and Machine Learning: Basic Methodology and Risk Modeling Applications , 2000 .

[9]  Dean P. Foster,et al.  Variable Selection in Data Mining: Building a Predictive Model for Bankruptcy , 2001 .

[10]  Shigeo Abe DrEng Pattern Classification , 2001, Springer London.

[11]  Amir F. Atiya,et al.  Bankruptcy prediction for credit risk using neural networks: A survey and new results , 2001, IEEE Trans. Neural Networks.

[12]  Paul S. Calem,et al.  An overview of consumer data and credit reporting , 2003 .

[13]  C. Goose,et al.  Glossary of Terms , 2004, Machine Learning.

[14]  Soushan Wu,et al.  Credit rating analysis with support vector machines and neural networks: a market comparative study , 2004, Decis. Support Syst..

[15]  R. Avery,et al.  Consumer Credit Scoring: Do Situational Circumstances Matter? , 2004 .

[16]  Ron Kohavi,et al.  Guest Editors' Introduction: On Applied Research in Machine Learning , 1998, Machine Learning.

[17]  Kyung-shik Shin,et al.  An application of support vector machines in bankruptcy prediction model , 2005, Expert Syst. Appl..

[18]  Young-Chan Lee,et al.  Bankruptcy prediction using support vector machine with optimal choice of kernel function parameters , 2005, Expert Syst. Appl..

[19]  C ONG,et al.  Building credit scoring models using genetic programming , 2005, Expert Syst. Appl..

[20]  Roger M. Stein The relationship between default prediction and lending profits: Integrating ROC analysis and loan pricing , 2005 .

[21]  Douglas W. Dwyer,et al.  Inferring the default rate in a population by comparing two incomplete default databases , 2006 .

[22]  Sheng-Tun Li,et al.  The evaluation of consumer loans using support vector machines , 2006, Expert Syst. Appl..

[23]  William W. Lang,et al.  Competitive Effects of Basel II on U . S . Bank Credit Card Lending , 2006 .

[24]  Radford M. Neal Pattern Recognition and Machine Learning , 2007, Technometrics.

[25]  Mu-Chen Chen,et al.  Credit scoring with a data mining approach based on support vector machines , 2007, Expert Syst. Appl..

[26]  Bart Baesens,et al.  Comprehensible Credit Scoring Models Using Rule Extraction from Support Vector Machines , 2007, Eur. J. Oper. Res..

[27]  Anthony Saunders,et al.  The Economics of Credit Cards, Debit Cards and Atms: A Survey and Some New Evidence , 2006 .

[28]  D. Storey,et al.  Are good or bad borrowers discouraged from applying for loans? Evidence from US small business credit markets , 2009 .

[29]  L. Thomas Introduction to consumer credit and credit scoring , 2009 .

[30]  Jonathan Crook,et al.  Support vector machines for credit scoring and discovery of significant features , 2009, Expert Syst. Appl..

[31]  C. Perignon,et al.  The Level and Quality of Value-at-Risk Disclosure by Commercial Banks , 2009 .

[32]  Sumit Agarwal,et al.  Benefits of Relationship Banking: Evidence from Consumer Credit Markets , 2009, Journal of Monetary Economics.

[33]  A. Savvopoulos Consumer Credit Models: Pricing, Profit and Portfolios , 2010 .

[34]  N. Valev,et al.  The role of household and business credit in banking crises , 2010 .

[35]  Mathias Drehmann,et al.  The integrated impact of credit and interest rate risk on banks: A dynamic framework and stress testing application , 2010 .

[36]  Kylie Smith,et al.  Price incentives and consumer payment behaviour , 2010 .

[37]  Wei-Yin Loh,et al.  Classification and regression trees , 2011, WIREs Data Mining Knowl. Discov..