Applying BERT to analyze investor sentiment in stock market

This paper is an analysis of investor sentiment in the stock market based on the bidirectional encoder representations from transformers (BERT) model. First, we extracted the sentiment value from online information published by stock investor, using the Bert model. Second, these sentiment values were weighted by attention for computing the investor sentiment indicator. Finally, the relationship between investor sentiment and stock yield was analyzed through a two-step cross-sectional regression validation model. The experiments found that investor sentiment in online reviews had a significant impact on stock yield. The experiments show that the Bert model used in this paper can achieve an accuracy of 97.35% for the analysis of investor sentiment, which is better than both LSTM and SVM methods.

[1]  Martin E. Zweig AN INVESTOR EXPECTATIONS STOCK PRICE PREDICTIVE MODEL USING CLOSED‐END FUND PREMIUMS , 1973 .

[2]  Kyoung-Sook Moon,et al.  Performance of Deep Learning in Prediction of Stock Market Volatility , 2019, ECONOMIC COMPUTATION AND ECONOMIC CYBERNETICS STUDIES AND RESEARCH.

[3]  Malcolm P. Baker,et al.  Investor Sentiment and the Cross-Section of Stock Returns , 2003 .

[4]  Maik Schmeling Investor sentiment and stock returns: Some international evidence , 2009 .

[5]  Kenneth L. Fisher,et al.  Investor Sentiment and Stock Returns , 2000 .

[6]  Yang Yang,et al.  Stock closing price prediction based on sentiment analysis and LSTM , 2019, Neural Computing and Applications.

[7]  Rob J Hyndman,et al.  25 Years of Iif Time Series Forecasting: A Selective Review , 2005 .

[8]  B. Swaminathan,et al.  Time-Varying Expected Small Firm Returns and Closed-End Fund Discounts , 1996 .

[9]  Michael T. Cliff,et al.  Investor Sentiment and the Near-Term Stock Market , 2001 .

[10]  San-Lin Chung,et al.  When does investor sentiment predict stock returns , 2012 .

[11]  Yong Huang,et al.  Investor sentiment and stock price: Empirical evidence from Chinese SEOs , 2021 .

[12]  D. Zhang,et al.  High-speed Train Control System Big Data Analysis Based on the Fuzzy RDF model and Uncertain Reasoning , 2017, Int. J. Comput. Commun. Control.

[13]  S. Rinaldi,et al.  Positive Linear Systems: Theory and Applications , 2000 .

[14]  Ellen Riloff,et al.  Learning subjective nouns using extraction pattern bootstrapping , 2003, CoNLL.

[15]  Xiaojun Wan,et al.  Co-Training for Cross-Lingual Sentiment Classification , 2009, ACL.

[16]  Alexander Kurov,et al.  Investor Sentiment and the Stock Market’s Reaction to Monetary Policy , 2009 .

[17]  R. Thaler,et al.  Investor Sentiment and the Closed-End Fund Puzzle , 1990 .

[18]  S Roh,et al.  Production of HanWoo (Bos taurus coreanae) fetuses following interbreed somatic cell nuclear transfer. , 2001, The Journal of veterinary medical science.