Credit scoring in banks and financial institutions via data mining techniques: A literature review

This paper presents a comprehensive review of the studies conducted in the application of data mining techniques focus on credit scoring from 2000 to 2012. Yet, there isn‟t adequate literature reviews in the field of data mining applications in credit scoring. Using a novel research approach, this paper investigates academic and systematic literature review and includes all of the journals in the Science direct online journal database. The studies are categorized and classified into enterprise, individual and small and midsized (SME) companies credit scoring. Data mining techniques are also categorized to single classifier, Hybrid methods and Ensembles. Variable selection methods are also investigated separately because there is a major issue in a credit scoring problem. The findings of this literature review reveals that data mining techniques are mostly applied to an individual credit score and there is inadequate research on enterprise and SME credit scoring. Also ensemble methods, support vector machines and neural network methods are the most favorite techniques used recently. Hybrid methods are investigated in four categories and two of the frequently used combinations are “classification and classification” and “clustering and classification”. This review of literature analysis provides scope for future research and concludes with some helpful suggestions for further research.

[1]  Hussein A. Abdou,et al.  Neural nets versus conventional techniques in credit scoring in Egyptian banking , 2008, Expert Syst. Appl..

[2]  Chih-Chou Chiu,et al.  Credit scoring using the hybrid neural discriminant technique , 2002, Expert Syst. Appl..

[3]  Arijit Laha Building contextual classifiers by integrating fuzzy rule based classification technique and k-nn method for credit scoring , 2007, Adv. Eng. Informatics.

[4]  Tian-Shyug Lee,et al.  Mining the customer credit using classification and regression tree and multivariate adaptive regression splines , 2006, Comput. Stat. Data Anal..

[5]  Mu-Chen Chen,et al.  Credit scoring and rejected instances reassigning through evolutionary computation techniques , 2003, Expert Syst. Appl..

[6]  David J. Hand,et al.  Statistical Classification Methods in Consumer Credit Scoring: a Review , 1997 .

[7]  Tetsuo Tomiyama,et al.  Advanced Engineering Informatics , 2007, Adv. Eng. Informatics.

[8]  Lin Ma,et al.  Mining the customer credit using hybrid support vector machine technique , 2009, Expert Syst. Appl..

[9]  Steven Finlay,et al.  Are we modelling the right thing? The impact of incorrect problem specification in credit scoring , 2009, Expert Syst. Appl..

[10]  A Ben David RULE EFFECTIVENESS IN RULE-BASED SYSTEMS: A CREDIT SCORING CASE STUDY , 2008 .

[11]  Witold Pedrycz,et al.  Data Mining Methods for Knowledge Discovery , 1998, IEEE Trans. Neural Networks.

[12]  Jure Zupan,et al.  Consumer Credit Scoring Models with Limited Data , 2007, Expert Syst. Appl..

[13]  Nan-Chen Hsieh,et al.  Hybrid mining approach in the design of credit scoring models , 2005, Expert Syst. Appl..

[14]  Chih-Fong Tsai,et al.  Using neural network ensembles for bankruptcy prediction and credit scoring , 2008, Expert Syst. Appl..

[15]  Eibe Frank,et al.  Accuracy of machine learning models versus "hand crafted" expert systems - A credit scoring case study , 2009, Expert Syst. Appl..

[16]  Bart Baesens,et al.  Credit Risk Management , 2008 .

[17]  Hewijin Christine Jiau,et al.  Evaluation of neural networks and data mining methods on a credit assessment task for class imbalance problem , 2006 .

[18]  Jian Ma,et al.  Rough set and scatter search metaheuristic based feature selection for credit scoring , 2012, Expert Syst. Appl..

[19]  L. Thomas A survey of credit and behavioural scoring: forecasting financial risk of lending to consumers , 2000 .

[20]  Christophe Mues,et al.  An experimental comparison of classification algorithms for imbalanced credit scoring data sets , 2012, Expert Syst. Appl..

[21]  Kin Keung Lai,et al.  Least squares support vector machines ensemble models for credit scoring , 2010, Expert Syst. Appl..

[22]  Stephen C. H. Leung,et al.  Vertical bagging decision trees model for credit scoring , 2010, Expert Syst. Appl..

[23]  Jian Ma,et al.  Two credit scoring models based on dual strategy ensemble trees , 2012, Knowl. Based Syst..

[24]  Ping Yao,et al.  Neighborhood rough set and SVM based hybrid credit scoring classifier , 2011, Expert Syst. Appl..

[25]  Jonathan N. Crook,et al.  Credit Scoring and Its Applications , 2002, SIAM monographs on mathematical modeling and computation.

[26]  Manoj Kumar Tiwari,et al.  Computational time reduction for credit scoring: An integrated approach based on support vector machine and stratified sampling method , 2012, Expert Syst. Appl..

[27]  Bart Baesens,et al.  Inferring descriptive and approximate fuzzy rules for credit scoring using evolutionary algorithms , 2007, Eur. J. Oper. Res..

[28]  Jian Ma,et al.  A comparative assessment of ensemble learning for credit scoring , 2011, Expert Syst. Appl..

[29]  Bee Wah Yap,et al.  Using data mining to improve assessment of credit worthiness via credit scoring models , 2011, Expert Syst. Appl..

[30]  L. Thomas Consumer credit models: pricing, profit and portfolios , 2009 .

[31]  David West,et al.  Neural network credit scoring models , 2000, Comput. Oper. Res..

[32]  Mu-Chen Chen,et al.  Credit scoring with a data mining approach based on support vector machines , 2007, Expert Syst. Appl..

[33]  Chun-Ling Chuang,et al.  Constructing a reassigning credit scoring model , 2009, Expert Syst. Appl..

[34]  Sheng-Tun Li,et al.  The evaluation of consumer loans using support vector machines , 2006, Expert Syst. Appl..

[35]  Loris Nanni,et al.  An experimental comparison of ensemble of classifiers for bankruptcy prediction and credit scoring , 2009, Expert Syst. Appl..

[36]  C ONG,et al.  Building credit scoring models using genetic programming , 2005, Expert Syst. Appl..

[37]  Yingxu Yang,et al.  Adaptive credit scoring with kernel learning methods , 2007, Eur. J. Oper. Res..

[38]  Tian-Shyug Lee,et al.  A two-stage hybrid credit scoring model using artificial neural networks and multivariate adaptive regression splines , 2005, Expert Syst. Appl..

[39]  Chunguang Zhou,et al.  Credit scoring algorithm based on link analysis ranking with support vector machine , 2009, Expert Syst. Appl..

[40]  Lun-Ping Hung,et al.  A data driven ensemble classifier for credit scoring analysis , 2009, Expert Syst. Appl..

[41]  Shouyang Wang,et al.  Rough set and Tabu search based feature selection for credit scoring , 2010, ICCS.

[42]  Bernadette Kamleitner,et al.  Consumer credit use: a process model and literature review , 2007 .

[43]  Daniel T. Larose,et al.  Discovering Knowledge in Data: An Introduction to Data Mining , 2005 .

[44]  Gianluca Antonini,et al.  Subagging for credit scoring models , 2010, Eur. J. Oper. Res..

[45]  Kevin Leung,et al.  A comparison of variable selection techniques for credit scoring , 2008 .

[46]  Feng-Chia Li,et al.  Combination of feature selection approaches with SVM in credit scoring , 2010, Expert Syst. Appl..

[47]  Hussein A. Abdou Genetic programming for credit scoring: The case of Egyptian public sector banks , 2009, Expert Syst. Appl..

[48]  Jih-Jeng Huang,et al.  Two-stage genetic programming (2SGP) for the credit scoring model , 2006, Appl. Math. Comput..

[49]  K. K. Jain,et al.  Neural network credit scoring model for micro enterprise financing in India , 2011 .

[50]  Ji Won Kim,et al.  Decision tree-based technology credit scoring for start-up firms: Korean case , 2012, Expert Syst. Appl..

[51]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[52]  Bart Baesens,et al.  Comprehensible Credit Scoring Models Using Rule Extraction from Support Vector Machines , 2007, Eur. J. Oper. Res..

[53]  David West,et al.  Neural network ensemble strategies for financial decision applications , 2005, Comput. Oper. Res..

[54]  Bor-Wen Cheng,et al.  Prediction model building with clustering-launched classification and support vector machines in credit scoring , 2009, Expert Syst. Appl..

[55]  Jonathan Crook,et al.  Support vector machines for credit scoring and discovery of significant features , 2009, Expert Syst. Appl..

[56]  Chih-Fong Tsai,et al.  Credit rating by hybrid machine learning techniques , 2010, Appl. Soft Comput..

[57]  Francisco Louzada,et al.  Poly-bagging predictors for classification modelling for credit scoring , 2011, Expert Syst. Appl..

[58]  Hussein A. Abdou,et al.  Credit Scoring, Statistical Techniques and Evaluation Criteria: A Review of the Literature , 2011, Intell. Syst. Account. Finance Manag..

[59]  Sven F. Crone,et al.  Instance sampling in credit scoring: An empirical study of sample size and balancing , 2012 .