Credit default prediction modeling: an application of support vector machine

Credit default prediction (CDP) modeling is a fundamental and critical issue for financial institutions. However, the previous studies indicate that the classifier’s performances in CDP analysis differ using different performance criterions on different databases under different circumstances. The performance assessment exercise under a set of criteria remains understudied in nature, on the one hand, and the real–scenario is not taken into account in that a single/very limited number of measure only are used, on the other hand. These problems affect the ability to make a consistent conclusion. Therefore, the aim of this study is to address this methodological issue by applying support vector machine (SVM)-based CDP algorithm by means of a set of representative performance criterions, with enclosing some novel performance measures, its performance compare with the results gained by statistical and intelligent approaches using six different types of databases from the credit prediction domains. Experimental results show that SVM model is marginally superior to CART with DA, being more robust than its other counterparts. In consequence, this study recommends that the supremacy of a classifier is linked to the way in which evaluations are measured.

[1]  Tian-Shyug Lee,et al.  A two-stage hybrid credit scoring model using artificial neural networks and multivariate adaptive regression splines , 2005, Expert Syst. Appl..

[2]  David Johnstone,et al.  An empirical evaluation of the performance of binary classifiers in the prediction of credit ratings changes , 2015 .

[3]  Jake Ansell,et al.  Measuring retail company performance using credit scoring techniques , 2007, Eur. J. Oper. Res..

[4]  Luis Cuadros-Rodríguez,et al.  Quality performance metrics in multivariate classification methods for qualitative analysis , 2016 .

[5]  David West,et al.  Neural network credit scoring models , 2000, Comput. Oper. Res..

[6]  Abdallah Bashir Musa Comparative study on classification performance between support vector machine and logistic regression , 2012, International Journal of Machine Learning and Cybernetics.

[7]  Brani Vidakovic Probability, Conditional Probability, and Bayes’ Rule , 2011 .

[8]  Kin Keung Lai,et al.  Least squares support vector machines ensemble models for credit scoring , 2010, Expert Syst. Appl..

[9]  N. Wilson,et al.  Financial distress and bankruptcy prediction among listed companies using accounting, market and macroeconomic variables , 2013 .

[10]  Guy Lapalme,et al.  A systematic analysis of performance measures for classification tasks , 2009, Inf. Process. Manag..

[11]  A. Asuncion,et al.  UCI Machine Learning Repository, University of California, Irvine, School of Information and Computer Sciences , 2007 .

[12]  José Hernández-Orallo,et al.  An experimental comparison of performance measures for classification , 2009, Pattern Recognit. Lett..

[13]  Brani Vidakovic,et al.  Statistics for Bioengineering Sciences , 2011 .

[14]  Brani Vidakovic,et al.  Statistics for Bioengineering Sciences: With MATLAB and WinBUGS Support , 2011 .

[15]  Mehdi Khashei,et al.  A novel hybrid classification model of artificial neural networks and multiple linear regression models , 2012, Expert Syst. Appl..

[16]  W. Youden,et al.  Index for rating diagnostic tests , 1950, Cancer.

[17]  Mu-Yen Chen,et al.  Bankruptcy prediction in firms with statistical and intelligent techniques and a comparison of evolutionary computation approaches , 2011, Comput. Math. Appl..

[18]  B J Biggerstaff,et al.  Comparing diagnostic tests: a simple graphic using likelihood ratios. , 2000, Statistics in medicine.

[19]  D B Matchar,et al.  Noninvasive Carotid Artery Testing: A Meta-analytic Review , 1995, Annals of Internal Medicine.

[20]  Sudhir Nanda,et al.  Linear models for minimizing misclassification costs in bankruptcy prediction , 2001, Intell. Syst. Account. Finance Manag..

[21]  Soner Akkoç,et al.  An empirical comparison of conventional techniques, neural networks and the three stage hybrid Adaptive Neuro Fuzzy Inference System (ANFIS) model for credit scoring analysis: The case of Turkish credit card data , 2012, Eur. J. Oper. Res..

[22]  Yunqian Ma,et al.  Practical selection of SVM parameters and noise estimation for SVM regression , 2004, Neural Networks.

[23]  Chunyan Miao,et al.  Comparing the learning effectiveness of BP, ELM, I-ELM, and SVM for corporate credit ratings , 2014, Neurocomputing.

[24]  Johan A. K. Suykens,et al.  Faculteit Economie En Bedrijfskunde Hoveniersberg 24 B-9000 Gent Bayesian Kernel-based Classification for Financial Distress Detection Dirk Van Den Poel 4 Bayesian Kernel Based Classification for Financial Distress Detection , 2022 .

[25]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[26]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[27]  Haibo He,et al.  Learning from Imbalanced Data , 2009, IEEE Transactions on Knowledge and Data Engineering.

[28]  Hui Li,et al.  Financial distress prediction using support vector machines: Ensemble vs. individual , 2012, Appl. Soft Comput..

[29]  J. Watada,et al.  Enhanced learning in neural networks and its application to financial statement analysis , 1994, Proceedings of 1994 IEEE International Conference on Neural Networks (ICNN'94).

[30]  Ilya A. Strebulaev,et al.  Structural Models of Credit Risk are Useful: Evidence from Hedge Ratios on Corporate Bonds , 2004 .

[31]  Maysam F. Abbod,et al.  Classifiers consensus system approach for credit scoring , 2016, Knowl. Based Syst..