Dreaming machine learning: Lipschitz extensions for reinforcement learning on financial markets

We consider a quasi-metric topological structure for the construction of a new reinforcement learning model in the framework of financial markets. It is based on a Lipschitz type extension of reward functions defined in metric spaces. Specifically, the McShane and Whitney extensions are considered for a reward function which is defined by the total evaluation of the benefits produced by the investment decision at a given time. We define the metric as a linear combination of a Euclidean distance and an angular metric component. All information about the evolution of the system from the beginning of the time interval is used to support the extension of the reward function, but in addition this data set is enriched by adding some artificially produced states. Thus, the main novelty of our method is the way we produce more states -- which we call "dreams" -- to enrich learning. Using some known states of the dynamical system that represents the evolution of the financial market, we use our technique to simulate new states by interpolating real states and introducing some random variables. These new states are used to feed a learning algorithm designed to improve the investment strategy by following a typical reinforcement learning scheme.

[1]  Liu Quan,et al.  Financial time series forecasting using LPP and SVM optimized by PSO , 2013, SOCO 2013.

[2]  Li Chen,et al.  News impact on stock price return via sentiment analysis , 2014, Knowl. Based Syst..

[3]  Algirdas Maknickas,et al.  Investigation of financial market prediction by recurrent neural network , 2011 .

[4]  Donald Michie,et al.  Learning concepts from data , 1998 .

[5]  Dong Gu Choi,et al.  An intelligent financial portfolio trading strategy using deep Q-learning , 2019, Expert Syst. Appl..

[6]  Pengjian Shang,et al.  Compositional segmentation of time series in the financial markets , 2015, Appl. Math. Comput..

[7]  Morteza Esfandyari,et al.  Stock Market Index Prediction Using Artificial Neural Network , 2016 .

[8]  Dietmar Maringer,et al.  Regime-switching recurrent reinforcement learning for investment decision making , 2012, Comput. Manag. Sci..

[9]  Chulwoo Han,et al.  Deep learning networks for stock market analysis and prediction: Methodology, data representations, and case studies , 2017, Expert Syst. Appl..

[10]  Dan A. Simovici,et al.  Mathematical Analysis for Machine Learning and Data Mining , 2018 .

[11]  Hong Jia,et al.  A new distance metric for unsupervised learning of categorical data , 2014, IEEE International Joint Conference on Neural Network.

[12]  Salim Lahmiri,et al.  A variational mode decompoisition approach for analysis and forecasting of economic and financial time series , 2016, Expert Syst. Appl..

[13]  Yang Wu,et al.  Metric Learning via Maximizing the Lipschitz Margin Ratio , 2018, ArXiv.

[14]  Adriano Lorena Inácio de Oliveira,et al.  Expert Systems With Applications , 2022 .

[15]  Xiaolong Wang,et al.  A support vector machine based MSM model for financial short-term volatility forecasting , 2011, Neural Computing and Applications.

[16]  Bill McDonald,et al.  Textual Analysis in Accounting and Finance: A Survey , 2016 .

[17]  Yao Yu,et al.  Fixed-Time Event-Triggered Consensus for Nonlinear Multiagent Systems Without Continuous Communications , 2019, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[18]  Weiguo Zhang,et al.  An evolutionary trend reversion model for stock trading rule discovery , 2015, Knowl. Based Syst..

[19]  Lee-Ad Gottlieb,et al.  Nearly Optimal Classification for Semimetrics , 2015, AISTATS.

[20]  Shom Prasad Das,et al.  A novel hybrid model using teaching–learning-based optimization and a support vector machine for commodity futures index forecasting , 2015, International Journal of Machine Learning and Cybernetics.

[21]  E. J. McShane,et al.  Extension of range of functions , 1934 .

[22]  M A H Dempster,et al.  An automated FX trading system using adaptive reinforcement learning , 2006, Expert Syst. Appl..

[23]  Thomas Fischer,et al.  Deep learning with long short-term memory networks for financial market predictions , 2017, Eur. J. Oper. Res..

[24]  Ammar Belatreche,et al.  Evaluating machine learning classification for financial trading: An empirical approach , 2016, Expert Syst. Appl..

[25]  Nicolas Huck,et al.  Deep neural networks, gradient-boosted trees, random forests: Statistical arbitrage on the S&P 500 , 2017, Eur. J. Oper. Res..

[26]  Ha Young Kim,et al.  Improving financial trading decisions using deep Q-learning: Predicting the number of shares, action strategies, and transfer learning , 2019, Expert Syst. Appl..

[27]  Youyong Kong,et al.  Deep Direct Reinforcement Learning for Financial Signal Representation and Trading , 2017, IEEE Transactions on Neural Networks and Learning Systems.

[28]  Zhengyao Jiang,et al.  Cryptocurrency portfolio management with deep reinforcement learning , 2016, 2017 Intelligent Systems Conference (IntelliSys).

[29]  Charalambos D. Aliprantis,et al.  Locally Solid Riesz Spaces with Applications to Economics , 2003 .

[30]  So Young Sohn,et al.  Global stock market investment strategies based on financial network indicators using machine learning techniques , 2019, Expert Syst. Appl..

[31]  Colm Kearney,et al.  Textual Sentiment in Finance: A Survey of Methods and Models , 2013 .

[32]  Daniel A. Spielman,et al.  Algorithms for Lipschitz Learning on Graphs , 2015, COLT.

[33]  Jonathan L. Ticknor A Bayesian regularized artificial neural network for stock market forecasting , 2013, Expert Syst. Appl..

[34]  Parag C. Pendharkar,et al.  Trading financial indices with reinforcement learning agents , 2018, Expert Syst. Appl..

[35]  Hong Jia,et al.  A New Distance Metric for Unsupervised Learning of Categorical Data , 2014, IEEE Transactions on Neural Networks and Learning Systems.

[36]  Pablo Moscato,et al.  Deep neural networks understand investors better , 2018, Decis. Support Syst..

[37]  Michael Defoort,et al.  Leader-follower fixed-time consensus for multi-agent systems with unknown non-linear inherent dynamics , 2015 .

[38]  Kavosh Asadi,et al.  Lipschitz Continuity in Model-based Reinforcement Learning , 2018, ICML.

[39]  Marco Corazza,et al.  Testing different Reinforcement Learning con?gurations for ?nancial trading: Introduction and applications , 2018 .

[40]  Johan Bollen,et al.  Twitter mood predicts the stock market , 2010, J. Comput. Sci..

[41]  Jun Wang,et al.  Fluctuation prediction of stock market index by Legendre neural network with random time strength function , 2012, Neurocomputing.

[42]  Ralf Korn,et al.  Option Pricing and Portfolio Optimization: Modern Methods of Financial Mathematics , 2001 .

[43]  Liu Yang,et al.  On neural networks and learning systems for business computing , 2018, Neurocomputing.

[44]  Thomas Gärtner,et al.  Graph kernels and Gaussian processes for relational reinforcement learning , 2006, Machine-mediated learning.

[45]  Stelios D. Bekiros,et al.  Heterogeneous trading strategies with adaptive fuzzy Actor-Critic reinforcement learning: A behavioral approach , 2010 .

[46]  Quanmin Zhu,et al.  Stable adaptive neurocontrol for nonlinear discrete-time systems , 2004, IEEE Trans. Neural Networks.

[47]  Chih-Chou Chiu,et al.  Financial time series forecasting using independent component analysis and support vector regression , 2009, Decis. Support Syst..

[48]  Jürgen Schmidhuber,et al.  Deep learning in neural networks: An overview , 2014, Neural Networks.

[49]  Shie-Jue Lee,et al.  A multiple-kernel support vector regression approach for stock market price forecasting , 2011, Expert Syst. Appl..

[50]  Jonghun Park,et al.  A Multiagent Approach to $Q$-Learning for Daily Stock Trading , 2007, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.

[51]  Ulrike von Luxburg,et al.  Distance-Based Classification with Lipschitz Functions , 2004, J. Mach. Learn. Res..

[52]  Sahil Shah,et al.  Predicting stock market index using fusion of machine learning techniques , 2015, Expert Syst. Appl..

[53]  Yongtao Hao,et al.  A feature weighted support vector machine and K-nearest neighbor algorithm for stock market indices prediction , 2017, Expert Syst. Appl..

[54]  Fang Liao,et al.  Distributed adaptive control for vehicular platoon with unknown dead‐zone inputs and velocity/acceleration disturbances , 2017 .

[55]  Stelios Bekiros,et al.  Heuristic learning in intraday trading under uncertainty , 2015 .

[56]  Salvador Romaguera,et al.  Semi-Lipschitz Functions and Best Approximation in Quasi-Metric Spaces , 2000 .

[57]  V. A. Mil'man Absolutely minimal extensions of functions on metric spaces , 1999 .

[58]  A. Murat Ozbayoglu,et al.  Algorithmic financial trading with deep convolutional neural networks: Time series to image conversion approach , 2018, Appl. Soft Comput..

[59]  Dietmar Maringer,et al.  Using a Genetic Algorithm to Improve Recurrent Reinforcement Learning for Equity Trading , 2015, Computational Economics.

[60]  Jan-P. Calliess,et al.  Lipschitz optimisation for Lipschitz Interpolation , 2017, 2017 American Control Conference (ACC).

[61]  G. Aronsson Extension of functions satisfying lipschitz conditions , 1967 .

[62]  Gisele L. Pappa,et al.  From an artificial neural network to a stock market day-trading system: A case study on the BM&F BOVESPA , 2009, 2009 International Joint Conference on Neural Networks.

[63]  Huaguang Zhang,et al.  General value iteration based reinforcement learning for solving optimal tracking control problem of continuous-time affine nonlinear systems , 2017, Neurocomputing.

[64]  Steve Y. Yang,et al.  An adaptive portfolio trading system: A risk-return portfolio optimization using recurrent reinforcement learning with expected maximum drawdown , 2017, Expert Syst. Appl..

[65]  David de la Fuente,et al.  Forecasting IBEX-35 moves using support vector machines , 2012, Neural Computing and Applications.

[66]  Bert Huang,et al.  Learning a Distance Metric from a Network , 2011, NIPS.

[67]  Lee-Ad Gottlieb,et al.  Efficient Classification for Metric Data , 2014, IEEE Trans. Inf. Theory.

[68]  Matthew Saffell,et al.  Learning to trade via direct reinforcement , 2001, IEEE Trans. Neural Networks.

[69]  Xiaolong Wang,et al.  A novel text mining approach to financial time series forecasting , 2012, Neurocomputing.

[70]  Ying Wah Teh,et al.  Text mining for market prediction: A systematic review , 2014, Expert Syst. Appl..

[71]  Michal Tkác,et al.  Artificial neural networks in business: Two decades of research , 2016, Appl. Soft Comput..

[72]  Thomas Gärtner,et al.  Graph kernels and Gaussian processes for relational reinforcement learning , 2006, Machine Learning.

[73]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.