Credit risk prediction using support vector machines

The main purpose of this paper is to examine the relative performance between least-squares support vector machines and logistic regression models for default classification and default probability estimation. The financial ratios from a data set of more than 78,000 financial statements from 2000 to 2006 are used as default indicators. The main focus of this paper is on the influence of small training samples and high variance of the financial input data and the classification performance measured by the area under the receiver operating characteristic. The resolution and the reliability of the predicted default probabilities are evaluated by decompositions of the Brier score. It is shown that support vector machines significantly outperform logistic regression models, particularly under the condition of small training samples and high variance of the input data.

[1]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[2]  H. Hersbach Decomposition of the Continuous Ranked Probability Score for Ensemble Prediction Systems , 2000 .

[3]  W. Härdle,et al.  Statistical Tools for Finance and Insurance , 2003 .

[4]  Nello Cristianini,et al.  An introduction to Support Vector Machines , 2000 .

[5]  Wolfgang Härdle,et al.  Predicting Bankruptcy with Support Vector Machines , 2005 .

[6]  Vadlamani Ravi,et al.  Bankruptcy prediction in banks and firms via statistical and intelligent techniques - A review , 2007, Eur. J. Oper. Res..

[7]  D. Bamber The area above the ordinal dominance graph and the area below the receiver operating characteristic graph , 1975 .

[8]  Johan A. K. Suykens,et al.  Least Squares Support Vector Machine Classifiers , 1999, Neural Processing Letters.

[9]  G. McLachlan Discriminant Analysis and Statistical Pattern Recognition , 1992 .

[10]  Bernhard Schölkopf,et al.  Learning with kernels , 2001 .

[11]  E. DeLong,et al.  Comparing the areas under two or more correlated receiver operating characteristic curves: a nonparametric approach. , 1988, Biometrics.

[12]  Amir F. Atiya,et al.  Bankruptcy prediction for credit risk using neural networks: A survey and new results , 2001, IEEE Trans. Neural Networks.

[13]  Liang-Hsuan Chen,et al.  A fuzzy credit-rating approach for commercial loans: a Taiwan case , 1999 .

[14]  D. Hosmer,et al.  Applied Logistic Regression , 1991 .

[15]  Johan A. K. Suykens,et al.  Benchmarking state-of-the-art classification algorithms for credit scoring , 2003, J. Oper. Res. Soc..

[16]  Franco Varetto Genetic algorithms applications in the analysis of insolvency risk , 1998 .

[17]  J. Crook,et al.  Credit scoring using neural and evolutionary techniques , 2000 .

[18]  Johan A. K. Suykens,et al.  Least Squares Support Vector Machines , 2002 .

[19]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[20]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[21]  Kenneth Carling,et al.  Corporate credit risk modeling and the macroeconomy , 2007 .

[22]  Shigeo Abe Support Vector Machines for Pattern Classification , 2010, Advances in Pattern Recognition.

[23]  Bernhard E. Boser,et al.  A training algorithm for optimal margin classifiers , 1992, COLT '92.

[24]  Yuh-Jye Lee,et al.  The Default Risk of Firms Examined with Smooth Support Vector Machines , 2007 .

[25]  R. Tibshirani,et al.  Generalized additive models for medical research , 1986, Statistical methods in medical research.

[26]  Robert W. Faff,et al.  An integrated multi-model credit rating system for private firms , 2006 .

[27]  Tomaso A. Poggio,et al.  Regularization Networks and Support Vector Machines , 2000, Adv. Comput. Math..

[28]  Johan A. K. Suykens,et al.  Benchmarking Least Squares Support Vector Machine Classifiers , 2004, Machine Learning.

[29]  Soushan Wu,et al.  Credit rating analysis with support vector machines and neural networks: a market comparative study , 2004, Decis. Support Syst..

[30]  Robert Tibshirani,et al.  The Elements of Statistical Learning: Data Mining, Inference, and Prediction, 2nd Edition , 2001, Springer Series in Statistics.

[31]  Wolfgang Karl Härdle,et al.  Estimation of Default Probabilities with Support Vector Machines , 2006 .

[32]  Lili Sun,et al.  A re-evaluation of auditors’ opinions versus statistical models in bankruptcy prediction , 2006 .