Heterogeneous trading strategies with adaptive fuzzy Actor-Critic reinforcement learning: A behavioral approach

The present study addresses the learning mechanism of boundedly rational agents in the dynamic and noisy environment of financial markets. The main objective is the development of a system that "decodes" the knowledge-acquisition strategy and the decision-making process of technical analysts called "chartists". It advances the literature on heterogeneous learning in speculative markets by introducing a trading system wherein market environment and agent beliefs are represented by fuzzy inference rules. The resulting functionality leads to the derivation of the parameters of the fuzzy rules by means of adaptive training. In technical terms, it expands the literature that has utilized Actor-Critic reinforcement learning and fuzzy systems in agent-based applications, by presenting an adaptive fuzzy reinforcement learning approach that provides with accurate and prompt identification of market turning points and thus higher predictability. The purpose of this paper is to illustrate this concretely through a comparative investigation against other well-established models. The results indicate that with the inclusion of transaction costs, the profitability of the novel system in case of NASDAQ Composite, FTSE100 and NIKKEI255 indices is consistently superior to that of a Recurrent Neural Network, a Markov-switching model and a Buy and Hold strategy. Overall, the proposed system via the reinforcement learning mechanism, the fuzzy rule-based state space modeling and the adaptive action selection policy, leads to superior predictions upon the direction-of-change of the market.

[1]  George Cybenko,et al.  Approximation by superpositions of a sigmoidal function , 1992, Math. Control. Signals Syst..

[2]  Nigel Harvey,et al.  Judging the probability that the next point in an observed time series will be below, or above, a given value , 1995 .

[3]  Stephen Yurkovich,et al.  Fuzzy Control , 1997 .

[4]  C. Hommes Heterogeneous Agent Models in Economics and Finance , 2005 .

[5]  Ken-ichi Funahashi,et al.  On the approximate realization of continuous mappings by neural networks , 1989, Neural Networks.

[6]  Richard S. Sutton,et al.  Learning to predict by the methods of temporal differences , 1988, Machine Learning.

[7]  T. Bollerslev,et al.  Generalized autoregressive conditional heteroskedasticity , 1986 .

[8]  Masafumi Hagiwara,et al.  Fuzzy inference neural network , 1997, Neurocomputing.

[9]  Gene H. Golub,et al.  Matrix computations , 1983 .

[10]  Hamid R. Berenji,et al.  Refinement of Approximate Reasoning-based Controllers by Reinforcement Learning , 1991, ML.

[11]  Kurt Hornik,et al.  Multilayer feedforward networks are universal approximators , 1989, Neural Networks.

[12]  Douglas M. Patterson,et al.  Nonlinear Dynamics, Chaos, And Instability , 1994 .

[13]  Kurt Hornik,et al.  Approximation capabilities of multilayer feedforward networks , 1991, Neural Networks.

[14]  C. Watkins Learning from delayed rewards , 1989 .

[15]  D. Rumelhart,et al.  Generalization by weight-elimination applied to currency exchange rate prediction , 1991, [Proceedings] 1991 IEEE International Joint Conference on Neural Networks.

[16]  Leigh Tesfatsion,et al.  Handbook of Computational Economics, Volume 2: Agent-Based Computational Economics (Handbook of Computational Economics) , 2006 .

[17]  Nicholas S. P. Tay,et al.  Fuzzy Inductive Reasoning, Expectation Formation and the Behavior of Security Prices , 2001 .

[18]  H. R. Berenji,et al.  Fuzzy Q-learning for generalization of reinforcement learning , 1996, Proceedings of IEEE 5th International Fuzzy Systems.

[19]  Robert E. Whaley,et al.  The Investor Fear Gauge , 2000 .

[20]  Perry J. Kaufman,et al.  Trading Systems and Methods , 1997 .

[21]  Shigenobu Kobayashi,et al.  An Analysis of Actor/Critic Algorithms Using Eligibility Traces: Reinforcement Learning with Imperfect Value Function , 1998, ICML.

[22]  Ramazan Gençay,et al.  Optimization of technical trading strategies and the profitability in security markets , 1998 .

[23]  Guojun Wu,et al.  Asymmetric Volatility and Risk in Equity Markets , 1997 .

[24]  Jeffrey L. Elman,et al.  Finding Structure in Time , 1990, Cogn. Sci..

[25]  H. Zimmermann,et al.  On the suitability of minimum and product operators for the intersection of fuzzy sets , 1979 .

[26]  J. Murphy Technical Analysis of the Futures Markets: A Comprehensive Guide to Trading Methods and Applications , 1986 .

[27]  Halbert White,et al.  On learning the derivatives of an unknown mapping with multilayer feedforward networks , 1992, Neural Networks.

[28]  Constantin von Altrock,et al.  Fuzzy Logic and NeuroFuzzy Applications in Business and Finance , 1996 .

[29]  C. Kirkpatrick,et al.  Technical Analysis: The Complete Resource for Financial Market Technicians , 2006 .

[30]  Adnan Shaout,et al.  Fuzzy logic modeling for performance appraisal systems a framework for empirical evaluation , 1998 .

[31]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[32]  Hecht-Nielsen Theory of the backpropagation neural network , 1989 .

[33]  S. Sosvilla‐Rivero,et al.  On the profitability of technical trading rules based on artificial neural networks:: Evidence from the Madrid stock market , 2000 .

[34]  Halbert White,et al.  Artificial neural networks: an econometric perspective ∗ , 1994 .

[35]  H. Ishibuchi,et al.  A learning algorithm of fuzzy neural networks with triangular fuzzy weights , 1995 .

[36]  P. Glorennec,et al.  Fuzzy Q-learning , 1997, Proceedings of 6th International Fuzzy Systems Conference.

[37]  P. Giot Relationships Between Implied Volatility Indexes and Stock Index Returns , 2005 .

[38]  E. Fama,et al.  Filter Rules and Stock-Market Trading , 1966 .

[39]  Bart Kosko,et al.  Neural networks and fuzzy systems , 1998 .

[40]  Camillo Lento,et al.  A Combined Signal Approach to Technical Analysis on the S&P 500 , 2008 .

[41]  P. D. Lima,et al.  Nuisance parameter free properties of correlation integral based statistics , 1996 .

[42]  David P. Simon The Nasdaq Volatility Index During and After the Bubble , 2003 .

[43]  Michio Sugeno,et al.  Industrial Applications of Fuzzy Control , 1985 .

[44]  H. Simon,et al.  Models of Man. , 1957 .

[45]  Andrew W. Moore,et al.  Reinforcement Learning: A Survey , 1996, J. Artif. Intell. Res..

[46]  Halbert White,et al.  Learning in Artificial Neural Networks: A Statistical Perspective , 1989, Neural Computation.

[47]  Timothy Masters,et al.  Practical neural network recipes in C , 1993 .

[48]  E. Fama,et al.  Efficient Capital Markets : II , 2007 .

[49]  E. Fama,et al.  Size and Book-to-Market Factors in Earnings and Returns , 1995 .

[50]  E. Fama EFFICIENT CAPITAL MARKETS: A REVIEW OF THEORY AND EMPIRICAL WORK* , 1970 .

[51]  George J. Klir,et al.  Fuzzy sets and fuzzy logic - theory and applications , 1995 .

[52]  P. Mohapatra,et al.  Forecasting and trading strategy for the foreign exchange market , 1993 .

[53]  Tony Plummer,et al.  Forecasting Financial Markets: The Psychology of Successful Investing , 1989 .

[54]  Yishay Mansour,et al.  Policy Gradient Methods for Reinforcement Learning with Function Approximation , 1999, NIPS.

[55]  J. Hull Options, Futures, and Other Derivatives , 1989 .

[56]  P. Krugman,et al.  Trigger Strategies and Price Dynamics in Equity and Foreign Exchange Markets , 1987 .

[57]  Philippe Jorion Value at Risk , 2001 .

[58]  Philip Hans Franses,et al.  Non-Linear Time Series Models in Empirical Finance , 2000 .

[59]  Panos J. Antsaklis,et al.  An introduction to intelligent and autonomous control , 1993 .

[60]  Ramazan Gençay,et al.  The predictability of security returns with simple technical trading rules , 1998 .

[61]  Cheolbeom Park,et al.  What Do We Know About the Profitability of Technical Analysis? , 2007 .

[62]  A. Christie,et al.  The stochastic behavior of common stock variances: value , 1982 .

[63]  Meng Joo Er,et al.  Online tuning of fuzzy inference systems using dynamic fuzzy Q-learning , 2004, IEEE Trans. Syst. Man Cybern. Part B.

[64]  Nikola Gradojevic,et al.  Non-linear, hybrid exchange rate modeling and trading profitability in the foreign exchange market , 2007 .

[65]  Marcus O'Connor,et al.  Exploring judgemental forecasting , 1992 .

[66]  Xiru Zhang Non-Linear Predictive Models for Intra-Day Foreign Exchange Trading , 1994 .

[67]  Kurt Hornik,et al.  Universal approximation of an unknown mapping and its derivatives using multilayer feedforward networks , 1990, Neural Networks.

[68]  Fred Collopy,et al.  How effective are neural networks at forecasting and prediction? A review and evaluation , 1998 .

[69]  Francis X. Diebold,et al.  The Rodney L. White Center for Financial Research Financial Asset Returns, Direction-of-Change Forecasting and Volatility , 2003 .

[70]  Douglas Wood,et al.  The profitability of daily stock market indices trades based on neural network predictions: case study for the S&P 500, the DAX, the TOPIX and the FTSE in the period 1965–1999 , 2004 .

[71]  S. Sumathi,et al.  Applications of Fuzzy Logic , 2007 .

[72]  James D. Hamilton A New Approach to the Economic Analysis of Nonstationary Time Series and the Business Cycle , 1989 .

[73]  E. H. Mamdani,et al.  Application of Fuzzy Logic to Approximate Reasoning Using Linguistic Synthesis , 1976, IEEE Transactions on Computers.

[74]  Vijaykumar Gullapalli,et al.  Reinforcement learning and its application to control , 1992 .

[75]  Charles R. Johnson,et al.  Topics in Matrix Analysis , 1991 .

[76]  L. Glosten,et al.  On the Relation between the Expected Value and the Volatility of the Nominal Excess Return on Stocks , 1993 .

[77]  James D. Hamilton Time Series Analysis , 1994 .

[78]  C. K. Ma,et al.  Memories, heteroscedasticity, and price limit in Currency futures markets , 1992 .

[79]  S. Cummer,et al.  Adaptive Filter for Event-Based Signal Extraction , 2004 .

[80]  John N. Tsitsiklis,et al.  Neuro-Dynamic Programming , 1996, Encyclopedia of Machine Learning.

[81]  Andrei Shleifer,et al.  Good News for Value Stocks: Further Evidence on Market Efficiency , 1995 .

[82]  Cars H. Hommes,et al.  Financial markets as nonlinear adaptive evolutionary systems , 2001 .

[83]  W. Brock,et al.  Heterogeneous beliefs and routes to chaos in a simple asset pricing model , 1998 .

[84]  Lionel Jouffe,et al.  Fuzzy inference system learning by reinforcement methods , 1998, IEEE Trans. Syst. Man Cybern. Part C.

[85]  J. Buckley,et al.  Fuzzy neural networks: a survey , 1994 .

[86]  G. Schwert,et al.  Stock Volatility in the New Millennium: How Wacky is NASDAQ? , 2001 .

[87]  C. S. George Lee,et al.  Neural fuzzy systems: a neuro-fuzzy synergism to intelligent systems , 1996 .

[88]  Chung-Ming Kuan,et al.  Reexamining the Profitability of Technical Analysis with Data Snooping Checks , 2005 .

[89]  L. Summers,et al.  The Noise Trader Approach to Finance , 1990 .

[90]  Richard M. Levich,et al.  The Significance of Technical Trading-Rule Profits in the Foreign Exchange Market: a Bootstrap Approach , 1991 .

[91]  H. White,et al.  There exists a neural network that does not make avoidable mistakes , 1988, IEEE 1988 International Conference on Neural Networks.

[92]  Andrew W. Lo,et al.  Foundations of Technical Analysis: Computational Algorithms, Statistical Inference, and Empirical Implementation , 2000 .