Improving corporate bond recovery rate prediction using multi-factor support vector regressions

Abstract In the multi-factor framework described in this paper, we use instrument-specific characteristics, several macroeconomic variables, and industry-specific characteristics as our explanatory variables for predicting recovery rates for corporate bonds. By including the principal components derived from a large number of macroeconomic variables, all three least-squares support vector regression methods, as well as the ordinary linear regression, exhibit higher out-of-sample predictive accuracy than the models that included only the few macroeconomic variables suggested in the literature. We compare the prediction accuracies of all techniques by incorporating sparse principal components, nonlinear principal components from an auto-associative neural network, and kernel principal components. Our results show that sparse principal components generate more interpretable and accurate estimations compared to the other principal component techniques. Moreover, we apply gradient boosting to generate a ranking of the 104 macroeconomic variables, from best to worst, based on their prediction power in recovery rate estimation. The three categories with the most informative macroeconomic predictors are micro-level factors, business cycle variables, and stock market indicators.

[1]  Johan A. K. Suykens,et al.  Least Squares Support Vector Machine Classifiers , 1999, Neural Processing Letters.

[2]  R. Tibshirani,et al.  Sparse Principal Component Analysis , 2006 .

[3]  Bernhard Schölkopf,et al.  Kernel Principal Component Analysis , 1997, ICANN.

[4]  Chih-Jen Lin,et al.  A Practical Guide to Support Vector Classication , 2008 .

[5]  Stephan K. Chalup,et al.  Kernel Methods in Finance , 2008 .

[6]  J. Friedman Greedy function approximation: A gradient boosting machine. , 2001 .

[7]  Jonathan Crook,et al.  Support vector machines for credit scoring and discovery of significant features , 2009, Expert Syst. Appl..

[8]  M. Kramer Nonlinear principal component analysis using autoassociative neural networks , 1991 .

[9]  E. Altman,et al.  Almost Everything You Wanted to Know about Recoveries on Defaulted Bonds , 1996 .

[10]  Richard Cantor,et al.  Determinants of Recovery Rates on Defaulted Bonds and Loans for North American Corporate Issuers , 2004 .

[11]  M. Aizerman,et al.  Theoretical Foundations of the Potential Function Method in Pattern Recognition Learning , 1964 .

[12]  Siem Jan Koopman,et al.  Modeling Frailty-Correlated Defaults Using Many Macroeconomic Covariates , 2009 .

[13]  M. Qi,et al.  Comparison of modeling methods for Loss Given Default , 2011 .

[14]  B. Baesens,et al.  Benchmarking regression algorithms for loss given default modeling , 2012 .

[15]  João A. Bastos Forecasting bank loans loss-given-default , 2010 .

[16]  M. Zenga,et al.  Bank loan recovery rates: Measuring and nonparametric density estimation , 2010 .

[17]  D. Duffie,et al.  Frailty Correlated Default , 2006 .

[18]  M. Subrahmanyam,et al.  The Determinants of Recovery Rates in the US Corporate Bond Market , 2014 .

[19]  E. Altman,et al.  Ultimate recovery mixtures , 2014 .

[20]  Rasmus Larsen,et al.  SpaSM: A MATLAB Toolbox for Sparse Statistical Modeling , 2018 .

[21]  Sreedhar T. Bharath,et al.  Does Industry-Wide Distress Affect Defaulted Firms? Evidence from Creditor Recoveries , 2007 .

[22]  Jonathan Crook,et al.  Support vector regression for loss given default modelling , 2015, Eur. J. Oper. Res..

[23]  Wei-Yin Loh,et al.  Classification and regression trees , 2011, WIREs Data Mining Knowl. Discov..

[24]  William W. Hsieh,et al.  Nonlinear principal component analysis of noisy data , 2006, The 2006 IEEE International Joint Conference on Neural Network Proceedings.

[25]  William W. Hsieh Nonlinear principal component analysis of noisy data , 2007, Neural Networks.

[26]  Bart Baesens,et al.  Forecasting Loss Given Default models: impact of account characteristics and the macroeconomic state , 2014, J. Oper. Res. Soc..

[27]  Daniel Rösch,et al.  Forecasting probabilities of default and loss rates given default in the presence of selection , 2012, J. Oper. Res. Soc..

[28]  Paulius Danenas,et al.  Selection of Support Vector Machines based classifiers for credit risk domain , 2015, Expert Syst. Appl..

[29]  Andrea Resti,et al.  The Link between Default and Recovery Rates: Theory, Empirical Evidence and Implications , 2003 .