A Multiclass Machine Learning Approach to Credit Rating Prediction

Corporate credit ratings are important financial indicators of investment risks. Traditional credit rating models employ classical econometrics methods with heteroscedasticity adjustments across various industries. In this paper, we propose using machine learning techniques in predicting corporate ratings and demonstrate, empirically, that multiclass machine learning algorithms outperform traditional econometrics models in exact, 1-notch, or 2-notch away rating predictions. We use three years of CompuStat data from four very different industries and compare corporate credit rating prediction tasks across linear regression, ordered probit model, bagged decision tree with Laplace smoothing, multiclass support vector machines (SVM), and multiclass proximal support vector machines (PSVM). Our findings show that with the proper multiclass and heteroscedasticity adjustments, the computationally inexpensive multiclass PSVM can be utilized in making viable automated corporate credit rating systems for todaypsilas vast marketplace.

[1]  Marimuthu Palaniswami,et al.  Selecting bankruptcy predictors using a support vector machine approach , 2000, Proceedings of the IEEE-INNS-ENNS International Joint Conference on Neural Networks. IJCNN 2000. Neural Computing: New Challenges and Perspectives for the New Millennium.

[2]  W. Greene,et al.  计量经济分析 = Econometric analysis , 2009 .

[3]  Alvin J. Surkan,et al.  Neural networks for bond rating improved by multiple hidden layers , 1990, 1990 IJCNN International Joint Conference on Neural Networks.

[4]  Tao Xiong,et al.  A combined SVM and LDA approach for classification , 2005, Proceedings. 2005 IEEE International Joint Conference on Neural Networks, 2005..

[5]  Constantin Zopounidis,et al.  A survey of business failures with an emphasis on prediction methods and industrial applications , 1996 .

[6]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[7]  Alan F. Murray,et al.  IEEE International Conference on Neural Networks , 1997 .

[8]  G. E. Pinches,et al.  A MULTIVARIATE ANALYSIS OF INDUSTRIAL BOND RATINGS , 1973 .

[9]  Gérard Dreyfus,et al.  Single-layer learning revisited: a stepwise procedure for building and training a neural network , 1989, NATO Neurocomputing.

[10]  Glenn Fung,et al.  Proximal support vector machine classifiers , 2001, KDD '01.

[11]  S. Dutta,et al.  Bond rating: a nonconservative application of neural networks , 1988, IEEE 1988 International Conference on Neural Networks.

[12]  Louis H. Ederington CLASSIFICATION MODELS AND BOND RATINGS , 1985 .

[13]  Glenn Fung,et al.  Multicategory Proximal Support Vector Machine Classifiers , 2005, Machine Learning.

[14]  John C. Platt,et al.  Fast training of support vector machines using sequential minimal optimization, advances in kernel methods , 1999 .

[15]  Richard T. Redmond,et al.  Expert systems for bond rating: a comparative analysis of statistical, rule‐based and neural network systems , 1993 .

[16]  Chih-Jen Lin,et al.  A comparison of methods for multiclass support vector machines , 2002, IEEE Trans. Neural Networks.

[17]  Soumitra Dutta,et al.  Bond rating: A non-conservative application of neural networks , 1988 .

[18]  Soushan Wu,et al.  Credit rating analysis with support vector machines and neural networks: a market comparative study , 2004, Decis. Support Syst..

[19]  Nello Cristianini,et al.  Large Margin DAGs for Multiclass Classification , 1999, NIPS.

[20]  Jeffrey S. Simonoff,et al.  Tree Induction Vs Logistic Regression: A Learning Curve Analysis , 2001, J. Mach. Learn. Res..

[21]  Ramakrishnan Srikant,et al.  Kdd-2001: Proceedings of the Seventh Acm Sigkdd International Conference on Knowledge Discovery and Data Mining : August 26-29, 2001 San Francisco, Ca, USA , 2002 .