Machine learning models and cost-sensitive decision trees for bond rating prediction

Abstract Since the outbreak of the financial crisis, the major global credit rating agencies have implemented significant changes to their methodologies to assess the sovereign credit risk. Therefore, bond rating prediction has become an interesting potential for investors and financial institutions. Previous research studies in this field have applied traditional statistical methods to develop models which provide prediction accuracy. However, no overall distinguished methods have been used in predicting bond ratings. Moreover, recent studies have suggested ensembles of classifiers that may be used in bond rating prediction. This article proposes an improved machine learning aimed to predict the rating of financial bonds. We empirically compare the classifiers ranging from logistic regression and discriminant analysis to nonparametric classifiers, such as support vector machine, neural networks, the cost-sensitive decision tree algorithm and deep neural networks. Three real-world bond rating data sets were selected to check the effectiveness and the viability of the set of the classifiers. The experimental results confirm that data mining methods can represent an alternative to the traditional prediction models of bond rating.

[1]  J. Maher,et al.  Earnings Predictability, Bond Ratings, and Bond Yields , 2005 .

[2]  Jakub M. Tomczak,et al.  Classification Restricted Boltzmann Machine for comprehensible credit scoring model , 2015, Expert Syst. Appl..

[3]  Raquel Florez-Lopez,et al.  Modelling of insurers’ rating determinants. An application of machine learning techniques and statistical models , 2007 .

[4]  Chulwoo Han,et al.  Deep learning networks for stock market analysis and prediction: Methodology, data representations, and case studies , 2017, Expert Syst. Appl..

[5]  Feng-Chia Li,et al.  Combination of feature selection approaches with SVM in credit scoring , 2010, Expert Syst. Appl..

[6]  Tomohiro Ando,et al.  Bayesian corporate bond pricing and credit default swap premium models for deriving default probabilities and recovery rates , 2014, J. Oper. Res. Soc..

[7]  Ramazan Aktas,et al.  Prediction of bank financial strength ratings: The case of Turkey , 2012 .

[8]  Md Zahidul Islam,et al.  Novel algorithms for cost-sensitive classification and knowledge discovery in class imbalanced datasets with an application to NASA software defects , 2018, Inf. Sci..

[9]  Yang Wang,et al.  Cost-sensitive boosting for classification of imbalanced data , 2007, Pattern Recognit..

[10]  Carla E. Brodley,et al.  Pruning Decision Trees with Misclassification Costs , 1998, ECML.

[11]  Chih-Chuan Chen,et al.  Credit rating with a monotonicity-constrained support vector machine model , 2014, Expert Syst. Appl..

[12]  Pingfeng Wang,et al.  Failure diagnosis using deep belief learning based health state classification , 2013, Reliab. Eng. Syst. Saf..

[13]  Stefan Lessmann,et al.  Approaches for credit scorecard calibration: An empirical analysis , 2017, Knowl. Based Syst..

[14]  Sebastián Maldonado,et al.  Cost-based feature selection for Support Vector Machines: An application in credit scoring , 2017, Eur. J. Oper. Res..

[15]  Ligang Zhou,et al.  The performance of corporate financial distress prediction models with features selection guided by domain knowledge and data mining approaches , 2015, Knowl. Based Syst..

[16]  Terry Harris,et al.  Credit scoring using the clustered support vector machine , 2015, Expert Syst. Appl..

[17]  Philippe du Jardin,et al.  Dynamics of firm financial evolution and bankruptcy prediction , 2017, Expert Syst. Appl..

[18]  Francisco Herrera,et al.  An insight into classification with imbalanced data: Empirical results and current trends on using data intrinsic characteristics , 2013, Inf. Sci..

[19]  António Afonso,et al.  Sovereign Credit Ratings and Financial Markets Linkages: Application to European Data , 2011, SSRN Electronic Journal.

[20]  Jian Ma,et al.  A comparative assessment of ensemble learning for credit scoring , 2011, Expert Syst. Appl..

[21]  Yue Xu,et al.  Cost-sensitive and hybrid-attribute measure multi-decision tree over imbalanced data sets , 2018, Inf. Sci..

[22]  Kenji Doya,et al.  Expected energy-based restricted Boltzmann machine for classification , 2015, Neural Networks.

[23]  Maysam F. Abbod,et al.  Classifiers consensus system approach for credit scoring , 2016, Knowl. Based Syst..

[24]  Hedieh Sajedi,et al.  A hybrid data mining model of feature selection algorithms and ensemble learning classifiers for credit scoring , 2015 .

[25]  Md Zahidul Islam,et al.  Software defect prediction using a cost sensitive decision forest and voting, and a potential solution to the class imbalance problem , 2015, Inf. Syst..

[26]  Ersin Namli,et al.  Modelling sovereign credit ratings: The accuracy of models in a heterogeneous sample , 2016 .

[27]  Vadlamani Ravi,et al.  Differential evolution trained wavelet neural networks: Application to bankruptcy prediction in banks , 2009, Expert Syst. Appl..

[28]  Amir F. Atiya,et al.  Bankruptcy prediction for credit risk using neural networks: A survey and new results , 2001, IEEE Trans. Neural Networks.

[29]  Arjana Brezigar-Masten,et al.  CART-based selection of bankruptcy predictors for the logit model , 2012, Expert Syst. Appl..

[30]  Y. Liu,et al.  Data mining feature selection for credit scoring models , 2005, J. Oper. Res. Soc..

[31]  Francisco Javier García Castellano,et al.  Expert Systems With Applications , 2022 .

[32]  Diego Klabjan,et al.  Implementing deep neural networks for financial market prediction on the Intel Xeon Phi , 2015, WHPCF@SC.

[33]  Sergio L. Schmukler,et al.  Emerging Markets Instability: Do Sovereign Ratings Affect Country Risk and Stock Returns? , 1999 .

[34]  Ning Chen,et al.  A genetic algorithm-based approach to cost-sensitive bankruptcy prediction , 2011, Expert Syst. Appl..

[35]  Robert P. W. Duin,et al.  Bagging, Boosting and the Random Subspace Method for Linear Classifiers , 2002, Pattern Analysis & Applications.

[36]  Yoav Freund,et al.  Experiments with a New Boosting Algorithm , 1996, ICML.

[37]  Wei Yang,et al.  Reject inference in credit scoring using Semi-supervised Support Vector Machines , 2017, Expert Syst. Appl..

[38]  Björn E. Ottersten,et al.  Example-dependent cost-sensitive decision trees , 2015, Expert Syst. Appl..

[39]  M. Zmijewski METHODOLOGICAL ISSUES RELATED TO THE ESTIMATION OF FINANCIAL DISTRESS PREDICTION MODELS , 1984 .

[40]  So Young Sohn,et al.  Technology credit scoring model with fuzzy logistic regression , 2016, Appl. Soft Comput..

[41]  Nicolas Huck,et al.  Deep neural networks, gradient-boosted trees, random forests: Statistical arbitrage on the S&P 500 , 2017, Eur. J. Oper. Res..

[42]  O. Gwilym,et al.  Sovereign rating actions and the implied volatility of stock index options , 2014 .

[43]  Mu-Yen Chen,et al.  Bankruptcy prediction in firms with statistical and intelligent techniques and a comparison of evolutionary computation approaches , 2011, Comput. Math. Appl..

[44]  Gerald Schaefer,et al.  Cost-sensitive decision tree ensembles for effective imbalanced classification , 2014, Appl. Soft Comput..

[45]  Eric Séverin,et al.  An investigation of bankruptcy prediction in imbalanced datasets , 2018, Decis. Support Syst..

[46]  David A. Elizondo,et al.  Bankruptcy forecasting: An empirical comparison of AdaBoost and neural networks , 2008, Decis. Support Syst..

[47]  Desheng Dash Wu,et al.  A deep learning approach for credit scoring using credit default swaps , 2017, Eng. Appl. Artif. Intell..

[48]  Mehtap Hisarciklilar,et al.  Sovereign Risk Ratings: Biased Toward Developed Countries? , 2011 .

[49]  Yufei Xia,et al.  A boosted decision tree approach using Bayesian hyper-parameter optimization for credit scoring , 2017, Expert Syst. Appl..

[50]  Nebojsa Nikolic,et al.  The application of brute force logistic regression to corporate credit scoring models: Evidence from Serbian financial statements , 2013, Expert Syst. Appl..

[51]  Janez Demsar,et al.  Statistical Comparisons of Classifiers over Multiple Data Sets , 2006, J. Mach. Learn. Res..

[52]  Roneel V. Sharan,et al.  Robust acoustic event classification using deep neural networks , 2017, Inf. Sci..

[53]  Yufei Xia,et al.  A novel heterogeneous ensemble credit scoring model based on bstacking approach , 2018, Expert Syst. Appl..

[54]  El-Sayed M. El-Horbaty,et al.  Classification using deep learning neural networks for brain tumors , 2017, Future Computing and Informatics Journal.

[55]  Hui Li,et al.  AdaBoost ensemble for financial distress prediction: An empirical comparison with data from Chinese listed companies , 2011, Expert Syst. Appl..

[56]  Gordon V. Karels,et al.  Multivariate Normality and Forecasting of Business Bankruptcy , 1987 .

[57]  J. Patell,et al.  The Experimental Design of Classification Models: An Application of Recursive Partitioning and Bootstrapping to Commercial Bank Loan Classifications , 1984 .

[58]  Robert Brooks,et al.  The National Market Impact of Sovereign Rating Changes , 2001 .

[59]  José Salvador Sánchez,et al.  Two-level classifier ensembles for credit risk assessment , 2012, Expert Syst. Appl..

[60]  Yi-Chung Hu,et al.  Comparing four bankruptcy prediction models: Logit, quadratic interval logit, neural and fuzzy neural networks , 2010, Expert Syst. Appl..

[61]  Hong Zhao,et al.  A cost sensitive decision tree algorithm with two adaptive mechanisms , 2015, Knowl. Based Syst..

[62]  Soushan Wu,et al.  Credit rating analysis with support vector machines and neural networks: a market comparative study , 2004, Decis. Support Syst..

[63]  Jian Ma,et al.  An improved SMO algorithm for financial credit risk assessment - Evidence from China's banking , 2018, Neurocomputing.

[64]  Hui Li,et al.  Predicting business failure using classification and regression tree: An empirical comparison with popular classical statistical methods and top classification mining methods , 2010, Expert Syst. Appl..

[65]  Pablo Moscato,et al.  Deep neural networks understand investors better , 2018, Decis. Support Syst..

[66]  I. Hasan,et al.  Financial Crises and Bank Failures: A Review of Prediction Methods , 2009 .

[67]  J. Ross Quinlan,et al.  Induction of Decision Trees , 1986, Machine Learning.

[68]  David L. Olson,et al.  Comparative analysis of data mining methods for bankruptcy prediction , 2012, Decis. Support Syst..

[69]  Patricio Valenzuela,et al.  Sovereign Ceilings 'Lite'? The Impact of Sovereign Ratings on Corporate Ratings in Emerging Market Economies , 2007, SSRN Electronic Journal.

[70]  Charles X. Ling,et al.  Hybrid Cost-Sensitive Decision Tree , 2005, PKDD.

[71]  Zijiang Yang,et al.  Using partial least squares and support vector machines for bankruptcy prediction , 2011, Expert Syst. Appl..

[72]  David Johnstone,et al.  An empirical evaluation of the performance of binary classifiers in the prediction of credit ratings changes , 2015 .

[73]  Philippe du Jardin,et al.  Predicting bankruptcy using neural networks and other classification methods: The influence of variable selection techniques on model accuracy , 2010, Neurocomputing.

[74]  Kai Ming Ting,et al.  An Instance-weighting Method to Induce Cost-sensitive Trees , 2001 .

[75]  O. Gwilym,et al.  Foreign exchange market reactions to sovereign credit news , 2012 .

[76]  Jin Liang,et al.  Utility indifference valuation of corporate bond with credit rating migration by structure approach , 2016 .

[77]  Ivor W. Tsang,et al.  Core Vector Machines: Fast SVM Training on Very Large Data Sets , 2005, J. Mach. Learn. Res..

[78]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[79]  Oscar Camacho Nieto,et al.  The Naïve Associative Classifier (NAC): A novel, simple, transparent, and accurate classification model evaluated on financial data , 2017, Neurocomputing.

[80]  Zhao Wang,et al.  A Novel behavioral scoring model for estimating probability of default over time in peer-to-peer lending , 2018, Electron. Commer. Res. Appl..

[81]  Maria H. Kim,et al.  Information asymmetry and investor trading behavior around bond rating change announcements , 2017 .

[82]  André Carlos Ponce de Leon Ferreira de Carvalho,et al.  Deep learning for biological image classification , 2017, Expert Syst. Appl..

[83]  Ayman M. Eldeib,et al.  Breast cancer classification using deep belief networks , 2016, Expert Syst. Appl..

[84]  Elena Fedorova,et al.  Bankruptcy prediction for Russian companies: Application of combined classifiers , 2013, Expert Syst. Appl..

[85]  Ruibin Geng,et al.  Prediction of financial distress: An empirical study of listed Chinese companies using data mining , 2015, Eur. J. Oper. Res..

[86]  Bart Baesens,et al.  Credit rating prediction using Ant Colony Optimization , 2010, J. Oper. Res. Soc..

[87]  Chengqi Zhang,et al.  Cost-sensitive classification with inadequate labeled data , 2012, Inf. Syst..

[88]  Herbert Kimura,et al.  Machine learning models and bankruptcy prediction , 2017, Expert Syst. Appl..

[89]  Yu Wang,et al.  Ensemble classification based on supervised clustering for credit scoring , 2016, Appl. Soft Comput..

[90]  Rahib H. Abiyev,et al.  Credit Rating Using Type-2 Fuzzy Neural Networks , 2014 .

[91]  Vadlamani Ravi,et al.  Bankruptcy prediction in banks and firms via statistical and intelligent techniques - A review , 2007, Eur. J. Oper. Res..

[92]  ChongEunsuk,et al.  Deep learning networks for stock market analysis and prediction , 2017 .

[93]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[94]  José Ramón Cano,et al.  Prototype selection to improve monotonic nearest neighbor , 2017, Eng. Appl. Artif. Intell..

[95]  Joaquín Abellán,et al.  Improving experimental studies about ensembles of classifiers for bankruptcy prediction and credit scoring , 2014, Expert Syst. Appl..

[96]  Yoav Freund,et al.  Boosting a weak learning algorithm by majority , 1995, COLT '90.

[97]  Philippe du Jardin,et al.  A two-stage classification technique for bankruptcy prediction , 2016, Eur. J. Oper. Res..

[98]  John D. Jackson,et al.  A statistical approach to modeling the behavior of bond raters , 1988 .

[99]  Christophe Croux,et al.  Sovereign credit rating determinants: A comparison before and after the European debt crisis , 2017 .

[100]  Pedro Gomes,et al.  Sovereign credit ratings, market volatility, and financial gains , 2014, Comput. Stat. Data Anal..

[101]  Carlos Serrano-Cinca,et al.  Partial Least Square Discriminant Analysis for bankruptcy prediction , 2013, Decis. Support Syst..

[102]  Ekrem Duman,et al.  A cost-sensitive decision tree approach for fraud detection , 2013, Expert Syst. Appl..

[103]  Buket D. Barkana,et al.  Age and gender classification from speech and face images by jointly fine-tuned deep neural networks , 2017, Expert Syst. Appl..

[104]  Muhammad Awais,et al.  Medical image retrieval using deep convolutional neural network , 2017, Neurocomputing.

[105]  Paulius Danenas,et al.  Selection of Support Vector Machines based classifiers for credit risk domain , 2015, Expert Syst. Appl..

[106]  Lijuan Cao,et al.  Bond rating using support vector machine , 2006, Intell. Data Anal..