Credit Rating Change Modeling Using News and Financial Ratios

Credit ratings convey credit risk information to participants in financial markets, including investors, issuers, intermediaries, and regulators. Accurate credit rating information plays a crucial role in supporting sound financial decision-making processes. Most previous studies on credit rating modeling are based on accounting and market information. Text data are largely ignored despite the potential benefit of conveying timely information regarding a firm’s outlook. To leverage the additional information in news full-text for credit rating prediction, we designed and implemented a news full-text analysis system that provides firm-level coverage, topic, and sentiment variables. The novel topic-specific sentiment variables contain a large fraction of missing values because of uneven news coverage. The missing value problem creates a new challenge for credit rating prediction approaches. We address this issue by developing a missing-tolerant multinomial probit (MT-MNP) model, which imputes missing values based on the Bayesian theoretical framework. Our experiments using seven and a half years of real-world credit ratings and news full-text data show that (1) the overall news coverage can explain future credit rating changes while the aggregated news sentiment cannot; (2) topic-specific news coverage and sentiment have statistically significant impact on future credit rating changes; (3) topic-specific negative sentiment has a more salient impact on future credit rating changes compared to topic-specific positive sentiment; (4) MT-MNP performs better in predicting future credit rating changes compared to support vector machines (SVM). The performance gap as measured by macroaveraging F-measure is small but consistent.

[1]  Praveen Pathak,et al.  Making words work: Using financial text as a predictor of financial events , 2010, Decis. Support Syst..

[2]  Thomas E. Nichols Tools for statistical inference in functional & structural brain imaging , 2009 .

[3]  Hsinchun Chen,et al.  Sentiment analysis in multiple languages: Feature selection for opinion classification in Web forums , 2008, TOIS.

[4]  J. Neter,et al.  Applied Linear Regression Models , 1983 .

[5]  D. Hensher,et al.  Predicting Firm Financial Distress: A Mixed Logit Model , 2004 .

[6]  Yoav Freund,et al.  A Short Introduction to Boosting , 1999 .

[7]  Zhu Zhang,et al.  Deciphering word-of-mouth in social media: Text-based metrics of consumer reviews , 2012, TMIS.

[8]  Mike Y. Chen,et al.  Yahoo! for Amazon: Sentiment Extraction from Small Talk on the Web , 2001 .

[9]  Werner Antweiler,et al.  Is All that Talk Just Noise? The Information Content of Internet Stock Message Boards , 2001 .

[10]  James A. Ohlson FINANCIAL RATIOS AND THE PROBABILISTIC PREDICTION OF BANKRUPTCY , 1980 .

[11]  R. C. Merton,et al.  On the Pricing of Corporate Debt: The Risk Structure of Interest Rates , 1974, World Scientific Reference on Contingent Claims Analysis in Corporate Finance.

[12]  Chris Stewart,et al.  A note comparing support vector machines and ordered choice models' predictions of international banks' ratings , 2011, Decis. Support Syst..

[13]  Donald P. Cram,et al.  Assessing the Probability of Bankruptcy , 2004 .

[14]  Christopher D. Manning,et al.  Incorporating Non-local Information into Information Extraction Systems by Gibbs Sampling , 2005, ACL.

[15]  W. Greene,et al.  计量经济分析 = Econometric analysis , 2009 .

[16]  Joel Peress,et al.  Media Coverage and the Cross-Section of Stock Returns , 2008 .

[17]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[18]  Soushan Wu,et al.  Credit rating analysis with support vector machines and neural networks: a market comparative study , 2004, Decis. Support Syst..

[19]  Antonio Afonso,et al.  Ordered response models for sovereign debt ratings , 2006 .

[20]  F. T. Magiera Forecasting Bankruptcy More Accurately: A Simple Hazard Model , 2001 .

[21]  Xiao-Li Meng,et al.  Seeking efficient data augmentation schemes via conditional and marginal augmentation , 1999 .

[22]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[23]  D. Rubin INFERENCE AND MISSING DATA , 1975 .

[24]  Edward I. Altman,et al.  FINANCIAL RATIOS, DISCRIMINANT ANALYSIS AND THE PREDICTION OF CORPORATE BANKRUPTCY , 1968 .

[25]  Janyce Wiebe,et al.  Recognizing Contextual Polarity in Phrase-Level Sentiment Analysis , 2005, HLT.

[26]  Vineet Agarwal,et al.  Comparing the Performance of Market-Based and Accounting-Based Bankruptcy Prediction Models , 2006 .

[27]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[28]  Sofus A. Macskassy,et al.  More than Words: Quantifying Language to Measure Firms' Fundamentals the Authors Are Grateful for Assiduous Research Assistance from Jie Cao and Shuming Liu. We Appreciate Helpful Comments From , 2007 .

[29]  Thorsten Joachims,et al.  Making large scale SVM learning practical , 1998 .

[30]  Claire Cardie,et al.  Annotating Expressions of Opinions and Emotions in Language , 2005, Lang. Resour. Evaluation.

[31]  Raymond Y. K. Lau,et al.  Text mining and probabilistic language modeling for online review spam detection , 2012, TMIS.

[32]  Tim Loughran,et al.  When is a Liability not a Liability? Textual Analysis, Dictionaries, and 10-Ks , 2010 .

[33]  Joseph G. Ibrahim,et al.  Bayesian methods for generalized linear models with covariates missing at random , 2002 .

[34]  D. V. Dyk,et al.  A Bayesian analysis of the multinomial probit model using marginal data augmentation , 2005 .

[35]  Paul Hanouna,et al.  journal homepage: www.elsevier.com/locate/jbf , 2022 .

[36]  Yoav Freund,et al.  A decision-theoretic generalization of on-line learning and an application to boosting , 1995, EuroCOLT.

[37]  W. Beaver Financial Ratios As Predictors Of Failure , 1966 .

[38]  M. C. Jones,et al.  A reliable data-based bandwidth selection method for kernel density estimation , 1991 .

[39]  Mark Steyvers,et al.  Finding scientific topics , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[40]  Cheng-Few Lee,et al.  On multiple-class prediction of issuer credit ratings , 2009 .

[41]  Donald Geman,et al.  Stochastic relaxation, Gibbs distributions, and the Bayesian restoration of images , 1984 .

[42]  Hsinchun Chen,et al.  CyberGate: A Design Framework and System for Text Analysis of Computer-Mediated Communication , 2008, MIS Q..

[43]  Edward I. Altman,et al.  Corporate Financial Distress and Bankruptcy: Predict and Avoid Bankruptcy, Analyze and Invest in Distressed Debt , 2005 .

[44]  Yulan He,et al.  Joint sentiment/topic model for sentiment analysis , 2009, CIKM.

[45]  D. Opitz,et al.  Popular Ensemble Methods: An Empirical Study , 1999, J. Artif. Intell. Res..

[46]  Chenchuramaiah T. Bathala Giving Content to Investor Sentiment: The Role of Media in the Stock Market , 2007 .

[47]  Viral V. Acharya,et al.  Credit Risk: Pricing, Measurement, and Management , 2005 .

[48]  Matthias W. Uhl Explaining U.S. consumer behavior with news sentiment , 2011, TMIS.

[49]  S. Chib,et al.  Bayesian analysis of binary and polychotomous response data , 1993 .

[50]  S. Kothari,et al.  The Effect of Disclosures by Management, Analysts, and Business Press on Cost of Capital, Return Volatility, and Analyst Forecasts: A Study Using Content Analysis , 2009 .

[51]  W. S. Chan,et al.  Stock Price Reaction to News and No-News: Drift and Reversal after Headlines , 2001 .

[52]  Yoav Freund,et al.  Experiments with a New Boosting Algorithm , 1996, ICML.

[53]  C. K. Chu,et al.  Predicting issuer credit ratings using a semiparametric method , 2010 .

[54]  Hinrich Schütze,et al.  Introduction to information retrieval , 2008 .

[55]  Bo Pang,et al.  Thumbs up? Sentiment Classification using Machine Learning Techniques , 2002, EMNLP.

[56]  Janyce Wiebe,et al.  Articles: Recognizing Contextual Polarity: An Exploration of Features for Phrase-Level Sentiment Analysis , 2009, CL.

[57]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[58]  Sreedhar T. Bharath,et al.  Forecasting Default with the Merton Distance to Default Model , 2008 .

[59]  Xue Bai,et al.  Predicting consumer sentiments from online text , 2011, Decis. Support Syst..

[60]  Hsinchun Chen,et al.  Giving context to accounting numbers: The role of news coverage , 2011, Decis. Support Syst..

[61]  John Geweke,et al.  Evaluating the accuracy of sampling-based approaches to the calculation of posterior moments , 1991 .