Enhanced decision support in credit scoring using Bayesian binary quantile regression

Fierce competition as well as the recent financial crisis in financial and banking industries made credit scoring gain importance. An accurate estimation of credit risk helps organizations to decide whether or not to grant credit to potential customers. Many classification methods have been suggested to handle this problem in the literature. This paper proposes a model for evaluating credit risk based on binary quantile regression, using Bayesian estimation. This paper points out the distinct advantages of the latter approach: that is (i) the method provides accurate predictions of which customers may default in the future, (ii) the approach provides detailed insight into the effects of the explanatory variables on the probability of default, and (iii) the methodology is ideally suited to build a segmentation scheme of the customers in terms of risk of default and the corresponding uncertainty about the prediction. An often studied dataset from a German bank is used to show the applicability of the method proposed. The results demonstrate that the methodology can be an important tool for credit companies that want to take the credit risk of their customer fully into account.

[1]  Dirk Van den Poel,et al.  Benefits of quantile regression for the analysis of customer lifetime value in a contractual setting: An application in financial services , 2009, Expert Syst. Appl..

[2]  Lawrence M. Kahn Collective Bargaining and the Interindustry Wage Structure: International Evidence , 1998 .

[3]  Chih-Chou Chiu,et al.  Credit scoring using the hybrid neural discriminant technique , 2002, Expert Syst. Appl..

[4]  Moshe Buchinsky,et al.  The dynamics of changes in the female wage distribution in the USA: a quantile regression approach , 1998 .

[5]  Elizabeth Mays,et al.  Credit Scoring for Risk Managers: The Handbook for Lenders , 2003 .

[6]  Tu Bao Ho Rule Induction in Constructing Knowledge-Based Decision Support , 2002 .

[7]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[8]  Tian-Shyug Lee,et al.  A two-stage hybrid credit scoring model using artificial neural networks and multivariate adaptive regression splines , 2005, Expert Syst. Appl..

[9]  V. Barnett,et al.  Applied Linear Statistical Models , 1975 .

[10]  N. Kiefer,et al.  The Bayesian Approach to Default Risk: A Guide , 2010 .

[11]  C. Manski MAXIMUM SCORE ESTIMATION OF THE STOCHASTIC UTILITY MODEL OF CHOICE , 1975 .

[12]  James O. Berger,et al.  The interplay of Bayesian and frequentist analysis , 2004 .

[13]  Paul Gustafson,et al.  On cross‐validation of Bayesian models , 2001 .

[14]  Andrew P. Bradley,et al.  The use of the area under the ROC curve in the evaluation of machine learning algorithms , 1997, Pattern Recognit..

[15]  Mark L. Berenson,et al.  Business Statistics , 2009, International Encyclopedia of Statistical Science.

[16]  S. Chib,et al.  Understanding the Metropolis-Hastings Algorithm , 1995 .

[17]  Nicolai Meinshausen,et al.  Quantile Regression Forests , 2006, J. Mach. Learn. Res..

[18]  D. Hand,et al.  A k-nearest-neighbour classifier for assessing consumer credit risk , 1996 .

[19]  Karl Rihaczek,et al.  1. WHAT IS DATA MINING? , 2019, Data Mining for the Social Sciences.

[20]  David J. Hand,et al.  A survey of the issues in consumer credit modelling research , 2005, J. Oper. Res. Soc..

[21]  J. Ross Quinlan,et al.  Induction of Decision Trees , 1986, Machine Learning.

[22]  Bart Baesens,et al.  Credit rating prediction using Ant Colony Optimization , 2010, J. Oper. Res. Soc..

[23]  Kevin F. Hallock,et al.  Individual heterogeneity in the returns to schooling: instrumental variables quantile regression using twins data , 1999 .

[24]  Dirk Van den Poel,et al.  Binary quantile regression: a Bayesian approach based on the asymmetric Laplace distribution , 2012 .

[25]  Mira Antonietta,et al.  Bayesian estimate of credit risk via MCMC with delayed rejection , 2003 .

[26]  Ralf Stecking,et al.  Support Vector Machines for Credit Scoring: Extension to Non Standard Cases , 2005 .

[27]  Dirk Van den Poel,et al.  The impact of sample bias on consumer credit scoring performance and profitability , 2005, J. Oper. Res. Soc..

[28]  Moshe Buchinsky CHANGES IN THE U.S. WAGE STRUCTURE 1963-1987: APPLICATION OF QUANTILE REGRESSION , 1994 .

[29]  Hsiu-lang Chen,et al.  Portfolio style: Return-based attribution using quantile regression , 2001 .

[30]  T. Bui Decision Support systems for Sustainable Development , 2002 .

[31]  David West,et al.  Neural network ensemble strategies for financial decision applications , 2005, Comput. Oper. Res..

[32]  Mu-Chen Chen,et al.  Credit scoring with a data mining approach based on support vector machines , 2007, Expert Syst. Appl..

[33]  Jonathan Crook,et al.  Support vector machines for credit scoring and discovery of significant features , 2009, Expert Syst. Appl..

[34]  R. Fisher THE USE OF MULTIPLE MEASUREMENTS IN TAXONOMIC PROBLEMS , 1936 .

[35]  Rahim Alhamzawi,et al.  bayesQR: Bayesian quantile regression , 2011 .

[36]  Gregory Kordas Smoothed binary regression quantiles , 2006 .

[37]  Gary Chamberlain,et al.  QUANTILE REGRESSION, CENSORING, AND THE STRUCTURE OF WAGES , 1991 .

[38]  Ray H. Anderson The Credit Scoring Toolkit: Theory and Practice for Retail Credit Risk Management and Decision Automation , 2007 .

[39]  J. Suykens,et al.  Benchmarking state-of-the-art classification algorithms for credit scoring: An update of research , 2015, Eur. J. Oper. Res..

[40]  Johan A. K. Suykens,et al.  Benchmarking state-of-the-art classification algorithms for credit scoring , 2003, J. Oper. Res. Soc..

[41]  R. Koenker,et al.  Regression Quantiles , 2007 .

[42]  J. Galindo,et al.  Credit Risk Assessment Using Statistical and Machine Learning: Basic Methodology and Risk Modeling Applications , 2000 .

[43]  Daniel Martin,et al.  Early warning of bank failure: A logit regression approach , 1977 .

[44]  Gregory Kordas Credit Scoring Using Binary Quantile Regression , 2002 .

[45]  Conditional value-at-risk: Aspects of modeling and estimation , 2001 .

[46]  Keming Yu,et al.  A Three-Parameter Asymmetric Laplace Distribution and Its Extension , 2005 .

[47]  Bart Baesens,et al.  Using Neural Network Rule Extraction and Decision Tables for Credit - Risk Evaluation , 2003, Manag. Sci..

[48]  W S McCulloch,et al.  A logical calculus of the ideas immanent in nervous activity , 1990, The Philosophy of Artificial Intelligence.

[49]  Peter Miu,et al.  A Hybrid Bankruptcy Prediction Model with Dynamic Loadings on Accounting-Ratio-Based and Market-Based Information: A Binary Quantile Regression Approach , 2010 .

[50]  R. H. Myers Classical and modern regression with applications , 1986 .

[51]  Qi Fei,et al.  A comparative study of data mining methods in consumer loans credit scoring management , 2006 .

[52]  Alberto Maria Segre,et al.  Programs for Machine Learning , 1994 .

[53]  Songfeng Zheng,et al.  QBoost: Predicting quantiles with boosting for regression and binary classification , 2012, Expert Syst. Appl..