Improving credit scoring by differentiating defaulter behaviour

We present a methodology for improving credit scoring models by distinguishing two forms of rational behaviour of loan defaulters. It is common knowledge among practitioners that there are two types of defaulters, those who do not pay because of cash flow problems (‘Can’t Pay’), and those that do not pay because of lack of willingness to pay (‘Won’t Pay’). This work proposes to differentiate them using a game theory model that describes their behaviour. This separation of behaviours is represented by a set of constraints that form part of a semi-supervised constrained clustering algorithm, constructing a new target variable summarizing relevant future information. Within this approach the results of several supervised models are benchmarked, in which the models deliver the probability of belonging to one of these three new classes (good payers, ‘Can’t Pays’, and ‘Won’t Pays’). The process improves classification accuracy significantly, and delivers strong insights regarding the behaviour of defaulters.

[1]  A. Rapoport,et al.  Discount rates inferred from decisions: an experimental study , 1987 .

[2]  Edward J. Janger,et al.  The Myth of the Rational Borrower: Behaviorism, Rationality and the Misguided Reform of Bankruptcy Law , 2005 .

[3]  Aldo Rustichini,et al.  Cognitive skills affect economic preferences, strategic behavior, and job attachment , 2009, Proceedings of the National Academy of Sciences.

[4]  H. Wette Collateral in Credit Rationing in Markets with Imperfect Information: Note , 1983 .

[5]  Paola Sapienza,et al.  The Determinants of Attitudes towards Strategic Default on Mortgages , 2011 .

[6]  Jonathan N. Crook,et al.  Credit Scoring and Its Applications , 2002, SIAM monographs on mathematical modeling and computation.

[7]  Paola Sapienza,et al.  The Determinants of Attitudes towards Strategic Default on Mortgages , 2011 .

[8]  Aldo Rustichini,et al.  Cognitive Skills Explain Economic Preferences, Strategic Behavior, and Job Attachment , 2008, SSRN Electronic Journal.

[9]  Richard Weber,et al.  Granting and managing loans for micro-entrepreneurs: New developments and practical experiences , 2013, Eur. J. Oper. Res..

[10]  David W. Hosmer,et al.  Applied Logistic Regression , 1991 .

[11]  Cuneyt Guzelis,et al.  Gradient Networks for Clustering , 2006 .

[12]  DA de Waal An Investigation into the Use of Generalized Additive Neural Networks in Credit Scoring , 2008 .

[13]  L. Thomas A survey of credit and behavioural scoring: forecasting financial risk of lending to consumers , 2000 .

[14]  Yi Wang,et al.  Combining data mining and Game Theory in manufacturing strategy analysis , 2007, J. Intell. Manuf..

[15]  Esa Jokivuolle,et al.  A Model for Estimating Recovery Rates and Collateral Haircuts for Bank Loans , 2000 .

[16]  Peter A. Beling,et al.  A study in the combination of two consumer credit scores , 2001, J. Oper. Res. Soc..

[17]  J. Stiglitz,et al.  Credit Rationing in Markets with Imperfect Information , 1981 .

[18]  Angela K. Littwin,et al.  Beyond Usury: A Study of Credit Card Use and Preference Among Low-Income Consumers , 2007 .

[19]  Bart Baesens,et al.  Neural network survival analysis for personal loan data , 2005, J. Oper. Res. Soc..

[20]  P. G. Moffatt,et al.  Hurdle models of loan default , 2005, J. Oper. Res. Soc..

[21]  Jake Ansell,et al.  Predicting default of a small business using different definitions of financial distress , 2012, J. Oper. Res. Soc..

[22]  Jonathan F. Bard,et al.  Large-scale constrained clustering for rationalizing pickup and delivery operations , 2009 .

[23]  Toshinao Yoshiba,et al.  Analytical Solution for Expected Loss of a Collateralized Loan: A Square-root Intensity Process Negatively Correlated with Collateral Value , 2013 .

[24]  G. P. Patil,et al.  Spatially constrained clustering and upper level set scan hotspot detection in surveillance geoinformatics , 2006, Environmental and Ecological Statistics.

[25]  Johan A. K. Suykens,et al.  Benchmarking state-of-the-art classification algorithms for credit scoring , 2003, J. Oper. Res. Soc..

[26]  J. Suykens,et al.  Benchmarking state-of-the-art classification algorithms for credit scoring: An update of research , 2015, Eur. J. Oper. Res..

[27]  Richard Weber,et al.  Online phishing classification using adversarial data mining and signaling games , 2010, SKDD.

[28]  Guoqiang Peter Zhang,et al.  Avoiding Pitfalls in Neural Network Research , 2007, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[29]  David Laibson,et al.  Individual laboratory-measured discount rates predict field behavior , 2008, Journal of risk and uncertainty.

[30]  Ian Davidson,et al.  Constrained Clustering: Advances in Algorithms, Theory, and Applications , 2008 .

[31]  Richard Weber,et al.  Semi-supervised Constrained Clustering with Cluster Outlier Filtering , 2011, CIARP.

[32]  Naeem Siddiqi,et al.  Credit Risk Scorecards: Developing and Implementing Intelligent Credit Scoring , 2005 .

[33]  M. Dufwenberg Game theory. , 2011, Wiley interdisciplinary reviews. Cognitive science.

[34]  Vishal Vatsa,et al.  A Game-Theoretic Approach to Credit Card Fraud Detection , 2005, ICISS.

[35]  L. Green,et al.  Discounting of Delayed Rewards: A Life-Span Comparison , 1994 .

[36]  Stephan Meier,et al.  Overborrowing and undersaving: lessons and policy implications from research in behavioral economics , 2007 .

[37]  S. S. Ravi,et al.  Clustering with Constraints: Feasibility Issues and the k-Means Algorithm , 2005, SDM.

[38]  Christian Gollier,et al.  Debt Contract, Strategic Default, and Optimal Penalties with Judgement Errors * , 2004 .

[39]  U. Ben-Zion,et al.  Discount rates inferred from decisions: an experimental study , 1989 .

[40]  Raymond J. Mooney,et al.  A probabilistic framework for semi-supervised clustering , 2004, KDD.

[41]  Mark B. Sandler,et al.  Structural Segmentation of Musical Audio by Constrained Clustering , 2008, IEEE Transactions on Audio, Speech, and Language Processing.

[42]  Ashutosh Kumar Singh,et al.  The Elements of Statistical Learning: Data Mining, Inference, and Prediction , 2010 .