Hierarchical Multi-Scale Gaussian Transformer for Stock Movement Prediction

Predicting the price movement of finance securities like stocks is an important but challenging task, due to the uncertainty of financial markets. In this paper, we propose a novel approach based on the Transformer to tackle the stock movement prediction task. Furthermore, we present several enhancements for the proposed basic Transformer. Firstly, we propose a Multi-Scale Gaussian Prior to enhance the locality of Transformer. Secondly, we develop an Orthogonal Regularization to avoid learning redundant heads in the multi-head self-attention mechanism. Thirdly, we design a Trading Gap Splitter for Transformer to learn hierarchical features of high-frequency finance data. Compared with other popular recurrent neural networks such as LSTM, the proposed method has the advantage to mine extremely long-term dependencies from financial time series. Experimental results show our proposed models outperform several competitive methods in stock price prediction tasks for the NASDAQ exchange market and the China A-shares market.

[1]  Michael I. Jordan,et al.  Advances in Neural Information Processing Systems 30 , 1995 .

[2]  M. V. Rossum,et al.  In Neural Computation , 2022 .

[3]  Maosong Sun,et al.  Enhancing Stock Movement Prediction with Adversarial Training , 2018, IJCAI.

[4]  N. Altman An Introduction to Kernel and Nearest-Neighbor Nonparametric Regression , 1992 .

[5]  Gary Geunbae Lee,et al.  Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) , 2012, ACL 2012.

[6]  Dongyan Zhao,et al.  Get The Point of My Utterance! Learning Towards Effective Responses with Multi-Head Attention Mechanism , 2018, IJCAI.

[7]  Charu C. Aggarwal,et al.  Stock Price Prediction via Discovering Multi-Frequency Trading Patterns , 2017, KDD.

[8]  Adriano C. M. Pereira,et al.  Stock market's price movement prediction with LSTM neural networks , 2017, 2017 International Joint Conference on Neural Networks (IJCNN).

[9]  Hongwei Zhu,et al.  CLVSA: A Convolutional LSTM Based Variational Sequence-to-Sequence Model with Attention for Predicting Trends of Financial Markets , 2019, IJCAI.

[10]  Taylor Francis Online,et al.  The American statistician , 1947 .

[11]  Thomas G. Dietterich What is machine learning? , 2020, Archives of Disease in Childhood.

[12]  Ruihan Bao,et al.  Incorporating Fine-grained Events in Stock Movement Prediction , 2019, EMNLP.

[13]  Fadel M. Megahed,et al.  Stock market one-day ahead movement prediction using disparate data sources , 2017, Expert Syst. Appl..

[14]  R. Rosenfeld Nature , 2009, Otolaryngology--head and neck surgery : official journal of American Academy of Otolaryngology-Head and Neck Surgery.

[15]  Jinkyu Lee,et al.  Orthogonality Constrained Multi-Head Attention for Keyword Spotting , 2019, 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU).

[16]  Ting Liu,et al.  Gaussian Transformer: A Lightweight Approach for Natural Language Inference , 2019, AAAI.

[17]  Jinglu Hu,et al.  An SVM-based approach for stock market trend prediction , 2013, The 2013 International Joint Conference on Neural Networks (IJCNN).

[18]  Hsinchun Chen,et al.  Textual analysis of stock market prediction using breaking financial news: The AZFin text system , 2009, TOIS.

[19]  Steven Walczak,et al.  An Empirical Analysis of Data Requirements for Financial Forecasting with Neural Networks , 2001, J. Manag. Inf. Syst..

[20]  Wenhu Chen,et al.  Enhancing the Locality and Breaking the Memory Bottleneck of Transformer on Time Series Forecasting , 2019, NeurIPS.

[21]  Jian Li,et al.  Multi-Head Attention with Disagreement Regularization , 2018, EMNLP.

[22]  Geoffrey E. Hinton,et al.  Learning representations by back-propagating errors , 1986, Nature.