Credit scoring using data mining techniques with particular reference to Sudanese banks

One of the key success factors of lending organizations in general and banks in particular is the assessment of borrower credit worthiness in advance during the credit evaluation process. Credit scoring models have been applied by many researchers to improve the process of assessing credit worthiness by differentiating between prospective loans on the basis of the likelihood of repayment. Thus, credit scoring is a very typical Data Mining (DM) classification problem. Many traditional statistical and modern computational intelligence techniques have been presented in the literature to tackle this problem. The main objective of this paper is to describe an experiment of building suitable Credit Scoring Models (CSMs) for the Sudanese banks. Two commonly discussed data mining classification techniques are chosen in this paper namely: Decision Tree (DT) and Artificial Neural Networks (ANN). In addition Genetic Algorithms (GA) and Principal Component Analysis (PCA) are also applied as feature selection techniques. In addition to a Sudanese credit dataset, German credit dataset is also used to evaluate these techniques. The results reveal that ANN models outperform DT models in most cases. Using GA as a feature selection is more effective than PCA technique. The highest accuracy of German data set (80.67%) and Sudanese credit scoring models (69.74%) are achieved by a hybrid GA-ANN model. Although DT and its hybrid models (PCA-DT, GA-DT) are outperformed by ANN and its hybrid models (PCA-ANN, GA-ANN) in most cases, they produced interpretable loan granting decisions.

[1]  Hussein A. Abdou,et al.  Neural nets versus conventional techniques in credit scoring in Egyptian banking , 2008, Expert Syst. Appl..

[2]  Nnamdi I. Nwulu,et al.  Credit Scoring Using Soft Computing Schemes: A Comparison between Support Vector Machines and Artificial Neural Networks , 2011, DEIS.

[3]  Wei Xu,et al.  A Support Vector Machine Based Method for Credit Risk Assessment , 2010, 2010 IEEE 7th International Conference on E-Business Engineering.

[4]  Andres M. Gonzalez,et al.  Evolutionary Algorithms for Selecting the Architecture of a MLP Neural Network: A Credit Scoring Case , 2011, 2011 IEEE 11th International Conference on Data Mining Workshops.

[5]  Bo Tang,et al.  A new credit scoring method based on improved fuzzy support vector machine , 2012, 2012 IEEE International Conference on Computer Science and Automation Engineering (CSAE).

[6]  Defu Zhang,et al.  A Decision Tree Scoring Model Based on Genetic Algorithm and K-Means Algorithm , 2008, 2008 Third International Conference on Convergence and Hybrid Information Technology.

[7]  Adnan Khashman,et al.  Neural networks for credit risk evaluation: Investigation of different neural models and learning schemes , 2010, Expert Syst. Appl..

[8]  Leilei Zhang,et al.  Application of Support Vector Machines Method in Credit Scoring , 2009, ISNN.

[9]  Aroop Mukherjee Credit Scoring Model Using Data Mining Techniques–a pragmatic approach , 2010 .

[10]  Amir Arzy Soltan,et al.  A hybrid model using decision tree and neural network for credit scoring problem , 2012 .

[11]  Yi Jiang,et al.  A new approach based on a rough set and a decision tree to bank customer credit evaluation , 2008, 2008 IEEE International Symposium on IT in Medicine and Education.

[12]  C ONG,et al.  Building credit scoring models using genetic programming , 2005, Expert Syst. Appl..

[13]  Bhekisipho Twala,et al.  Multiple classifier application to credit risk assessment , 2010, Expert Syst. Appl..

[14]  Ming-chang Lee,et al.  Enterprise Credit Risk Evaluation models: A Review of Current Research Trends , 2012 .

[15]  Soner Akkoç,et al.  An empirical comparison of conventional techniques, neural networks and the three stage hybrid Adaptive Neuro Fuzzy Inference System (ANFIS) model for credit scoring analysis: The case of Turkish credit card data , 2012, Eur. J. Oper. Res..

[16]  Huwaida Tagelsir Elshoush,et al.  Alert correlation in collaborative intelligent intrusion detection systems - A survey , 2011, Appl. Soft Comput..

[17]  Raja Noor Ainon,et al.  Enhancement of transparency and accuracy of credit scoring models through genetic fuzzy classifier , 2010 .

[18]  Stjepan Oreski,et al.  Hybrid system with genetic algorithm and artificial neural networks and its application to retail credit risk assessment , 2012, Expert Syst. Appl..

[19]  David West,et al.  Neural network credit scoring models , 2000, Comput. Oper. Res..

[20]  José Salvador Sánchez,et al.  A literature review on the application of evolutionary computing to credit scoring , 2013, J. Oper. Res. Soc..

[21]  Jonathan Crook,et al.  Credit Scoring Models in the Credit Union Environment Using Neural Networks and Genetic Algorithms , 1997 .

[22]  Ahmad Ghodselahi,et al.  Application of Artificial Intelligence Techniques for Credit Risk Evaluation , 2011 .

[23]  Jonathan Crook,et al.  Support vector machines for credit scoring and discovery of significant features , 2009, Expert Syst. Appl..

[24]  Sotiris B. Kotsiantis,et al.  Machine learning: a review of classification and combining techniques , 2006, Artificial Intelligence Review.

[25]  Yap Bee Wah,et al.  Using data mining predictive models to classify credit card applicants , 2010, 2010 6th International Conference on Advanced Information Management and Service (IMS).

[26]  Rinkle Rani Aggarwal,et al.  Comparative Evaluation of Predictive Modeling Techniques on Credit Card Data , 2011 .

[27]  Rafael Pino-Mejías,et al.  Credit scoring models for the microfinance industry using neural networks: Evidence from Peru , 2013, Expert Syst. Appl..

[28]  Hian Chye Koh,et al.  A Two-step Method to Construct Credit Scoring Models with Data Mining Techniques , 2006 .

[29]  Jian Ma,et al.  A comparative assessment of ensemble learning for credit scoring , 2011, Expert Syst. Appl..

[30]  Jure Zupan,et al.  Consumer Credit Scoring Models with Limited Data , 2007, Expert Syst. Appl..

[31]  Ahmet Burak Emel,et al.  A credit scoring approach for the commercial banking sector , 2003 .

[32]  AbdouHussein,et al.  Neural nets versus conventional techniques in credit scoring in Egyptian banking , 2008 .

[33]  Bee Wah Yap,et al.  Using data mining to improve assessment of credit worthiness via credit scoring models , 2011, Expert Syst. Appl..

[34]  Johan A. K. Suykens,et al.  Benchmarking state-of-the-art classification algorithms for credit scoring , 2003, J. Oper. Res. Soc..

[35]  I-Cheng Yeh,et al.  The comparisons of data mining techniques for the predictive accuracy of probability of default of credit card clients , 2009, Expert Syst. Appl..

[36]  Hussein A. Abdou,et al.  Credit Scoring, Statistical Techniques and Evaluation Criteria: A Review of the Literature , 2011, Intell. Syst. Account. Finance Manag..

[37]  Wei Li,et al.  An Empirical Study on Credit Scoring Model for Credit Card by Using Data Mining Technology , 2011, 2011 Seventh International Conference on Computational Intelligence and Security.

[38]  Padraig Cunningham,et al.  Explaining the output of ensembles in medical decision support on a case by case basis , 2003, Artif. Intell. Medicine.

[39]  Ying Wah Teh,et al.  Credit Scoring Models Using Soft Computing Methods: A Survey , 2010, Int. Arab J. Inf. Technol..

[40]  R. Suganya,et al.  Data Mining Concepts and Techniques , 2010 .

[41]  Bart BaesensRudy Using Neural Network Rule Extraction and Decision Tables for Credit-Risk Evaluation , 2003 .

[42]  Tian-Shyug Lee,et al.  Mining the customer credit using classification and regression tree and multivariate adaptive regression splines , 2006, Comput. Stat. Data Anal..

[43]  Bart Baesens,et al.  Rule Extraction from Support Vector Machines: An Overview of Issues and Application in Credit Scoring , 2008, Rule Extraction from Support Vector Machines.

[44]  Hussein A. Abdou Genetic programming for credit scoring: The case of Egyptian public sector banks , 2009, Expert Syst. Appl..