Knowledge discovery for time series

My thesis investigates the use of machine learning methods for analysis of economic and financial time series. Since structural models in economics and finance are known to have limited predictive power, I study a data driven, time series approach to knowledge discovery in these domains. The ultimate goal of building predictive models of such time series is to support decision making in areas such as business, investing, and government policy. Machine learning offers powerful tools for forecasting and decision making. Supervised learning methods can be used to develop forecasting models of economic series that can aid in decision support. Reinforcement learning methods can produce systems capable of making investment decisions. Hence, my thesis consists of two main investigations: a study of methods for predicting macroeconomic and financial time series, and a study of extensions to a reinforcement learning algorithm for constructing financial decision systems. In the forecasting project, I develop a supervised training methodology for models that predict challenging macroeconomic and financial time series. I compare the performance of linear and nonlinear networks with a diverse set of standard linear benchmark models. While some advantage is obtained from the use of nonlinear networks for certain of these time series, a key result is that linear network models trained with stochastic, nonlinear neural network learning algorithms can achieve greatly improved performance over the benchmark methods on most of the data sets. The second topic investigated is enhancements to the Recurrent Reinforcement Learning (RRL) algorithm. The RRL approach to trading system design has been shown to be effective at learning strategies that directly maximize financial objective functions, and also has been shown to outperform approaches based on supervised learning on artificial data sets. In my work, I investigate several significant extensions of RRL: to incorporate downside risk measures, to compare the RRL policy approach to an alternate RL value function approach, to extend the approach to portfolio management, and to conduct simulation studies on a number of artificial and real data sets, including an S&P-500 asset allocation system and a high frequency foreign exchange trader.

[1]  M. Newman,et al.  A Biographical Note. , 2021, Science.

[2]  W. A. Clark,et al.  Simulation of self-organizing systems by digital computer , 1954, Trans. IRE Prof. Group Inf. Theory.

[3]  B. G. Farley,et al.  Generalization of pattern recognition in a self-organizing system , 1955, AFIPS '55 (Western).

[4]  Arthur L. Samuel,et al.  Some Studies in Machine Learning Using the Game of Checkers , 1967, IBM J. Res. Dev..

[5]  D. Cox,et al.  An Analysis of Transformations , 1964 .

[6]  A. L. Samuel,et al.  Some studies in machine learning using the game of checkers. II: recent progress , 1967 .

[7]  J. M. Bates,et al.  The Combination of Forecasts , 1969 .

[8]  R. C. Merton,et al.  Lifetime Portfolio Selection under Uncertainty: The Continuous-Time Case , 1969 .

[9]  A. E. Hoerl,et al.  Ridge Regression: Applications to Nonorthogonal Problems , 1970 .

[10]  E. Elton,et al.  DYNAMIC PROGRAMMING APPLICATIONS IN FINANCE , 1971 .

[11]  P. Young,et al.  Time series analysis, forecasting and control , 1972, IEEE Transactions on Automatic Control.

[12]  A. Zellner,et al.  Time series analysis and simultaneous equation econometric models , 1974 .

[13]  R. C. Merton,et al.  Optimum Consumption and Portfolio Rules in a Continuous-Time Model* , 1975 .

[14]  S. Ross,et al.  Option pricing: A simplified approach☆ , 1979 .

[15]  Stephen Taylor,et al.  Forecasting Economic Time Series , 1979 .

[16]  Robert B. Litterman,et al.  Forecasting and Conditional Projection Using Realistic Prior Distributions , 1983 .

[17]  Richard S. Sutton,et al.  Neuronlike adaptive elements that can solve difficult learning control problems , 1983, IEEE Transactions on Systems, Man, and Cybernetics.

[18]  Richard S. Sutton,et al.  Temporal credit assignment in reinforcement learning , 1984 .

[19]  Robert B. Litterman Forecasting with Bayesian Vector Autoregressions-Five Years of Experience , 1984 .

[20]  Geoffrey E. Hinton,et al.  Learning internal representations by error propagation , 1986 .

[21]  Stephen K. McNees Forecasting Accuracy of Alternative Techniques: A Comparison of U.S. Macroeconomic Forecasts , 1986 .

[22]  Jonathan D. Cryer,et al.  Time Series Analysis , 1986 .

[23]  C. Nelson The Investor's Guide to Economic Indicators , 1987 .

[24]  Michael C. Mozer,et al.  Skeletonization: A Technique for Trimming the Fat from a Network via Relevance Assessment , 1988, NIPS.

[25]  John Scott Bridle,et al.  Probabilistic Interpretation of Feedforward Classification Network Outputs, with Relationships to Statistical Pattern Recognition , 1989, NATO Neurocomputing.

[26]  B. Yandell Spline smoothing and nonparametric regression , 1989 .

[27]  Douglas T. Breeden Intertemporal Portfolio Theory and Asset Pricing , 1989 .

[28]  Ronald J. Williams,et al.  A Learning Algorithm for Continually Running Fully Recurrent Neural Networks , 1989, Neural Computation.

[29]  Yann LeCun,et al.  Optimal Brain Damage , 1989, NIPS.

[30]  R. Clemen Combining forecasts: A review and annotated bibliography , 1989 .

[31]  J. Plocek Economic Indicators: How America Reads Its Financial Health , 1990 .

[32]  Paul J. Werbos,et al.  Backpropagation Through Time: What It Does and How to Do It , 1990, Proc. IEEE.

[33]  P. Samuelson Asset allocation could be dangerous to your health , 1990 .

[34]  G. Wahba Spline models for observational data , 1990 .

[35]  Glenn D. Rudebusch,et al.  Forecasting Output with the Composite Leading Index: A Real-Time Analysis , 1991 .

[36]  Robert B. Litterman,et al.  Asset Allocation , 1991 .

[37]  Anders Krogh,et al.  Introduction to the theory of neural computation , 1994, The advanced book program.

[38]  John Moody,et al.  Note on generalization, regularization and architecture selection in nonlinear learning systems , 1991, Neural Networks for Signal Processing Proceedings of the 1991 IEEE Workshop.

[39]  John E. Moody,et al.  The Effective Number of Parameters: An Analysis of Generalization and Regularization in Nonlinear Learning Systems , 1991, NIPS.

[40]  David N. Nawrocki Optimal algorithms and lower partial moment: ex post results , 1991 .

[41]  F. Sortino,et al.  DOWNSIDE RISK - CAPTURING WHATS AT STAKE IN INVESTMENT SITUATIONS , 1991 .

[42]  David N. Nawrocki The characteristics of portfolios selected by n-degree Lower Partial Moment , 1992 .

[43]  Richard Stutely The Economist Guide to Economic Indicators: Making Sense of Economics , 1992 .

[44]  Babak Hassibi,et al.  Second Order Derivatives for Network Pruning: Optimal Brain Surgeon , 1992, NIPS.

[45]  C. Atkeson,et al.  Prioritized Sweeping : Reinforcement Learning withLess Data and Less Real , 1993 .

[46]  John E. Moody,et al.  Fast Pruning Using Principal Components , 1993, NIPS.

[47]  C. Granger,et al.  Modelling Nonlinear Economic Relationships , 1995 .

[48]  Jing Peng,et al.  Efficient Learning and Planning Within the Dyna Framework , 1993, Adapt. Behav..

[49]  John Moody,et al.  Predicting the U.S. Index of Industrial Production (Extended Abstract) , 1993 .

[50]  S. Hyakin,et al.  Neural Networks: A Comprehensive Foundation , 1994 .

[51]  W. Enders Applied Econometric Time Series , 1994 .

[52]  Gerald Tesauro,et al.  TD-Gammon, a Self-Teaching Backgammon Program, Achieves Master-Level Play , 1994, Neural Computation.

[53]  F. Diebold,et al.  Comparing Predictive Accuracy , 1994, Business Cycles.

[54]  Carsten Peterson,et al.  Finding the Embedding Dimension and Variable Dependencies in Time Series , 1994, Neural Computation.

[55]  Gerald Tesauro,et al.  Temporal Difference Learning and TD-Gammon , 1995, J. Int. Comput. Games Assoc..

[56]  John Moody,et al.  Architecture Selection Strategies for Neural Networks: Application to Corporate Bond Rating Predicti , 1995, NIPS 1995.

[57]  J. Utans,et al.  Input variable selection for neural networks: application to predicting the U.S. business cycle , 1995, Proceedings of 1995 Conference on Computational Intelligence for Financial Engineering (CIFEr).

[58]  J. Moody Economic forecasting : challenges and neural network solutions , 1995 .

[59]  Ben J. A. Kröse,et al.  Learning from delayed rewards , 1995, Robotics Auton. Syst..

[60]  S. Satchell,et al.  An assessment of the economic value of non‐linear foreign exchange rate forecasts , 1995 .

[61]  A. Timmermann,et al.  Predictability of Stock Returns: Robustness and Economic Significance , 1995 .

[62]  Thomas G. Dietterich,et al.  High-Performance Job-Shop Scheduling With A Time-Delay TD(λ) Network , 1995, NIPS 1995.

[63]  Andrew G. Barto,et al.  Improving Elevator Performance Using Reinforcement Learning , 1995, NIPS.

[64]  Dimitri P. Bertsekas,et al.  Dynamic Programming and Optimal Control, Two Volume Set , 1995 .

[65]  Ralph Neuneier,et al.  Optimal Asset Allocation using Adaptive Dynamic Programming , 1995, NIPS.

[66]  Robert A. Jarrow,et al.  Modelling Fixed Income Securities and Interest Rate Options , 1995 .

[67]  Andrew W. Moore,et al.  Reinforcement Learning: A Survey , 1996, J. Artif. Intell. Res..

[68]  John E. Moody,et al.  Forecasting the Economy with Neural Nets: A Survey of Challenges and Solutions , 1996, Neural Networks: Tricks of the Trade.

[69]  F. Sortino,et al.  On the Use and Misuse of Downside Risk , 1996 .

[70]  Richard M. Golden,et al.  Mathematical Methods for Neural Network Analysis and Design , 1996 .

[71]  John N. Tsitsiklis,et al.  Neuro-Dynamic Programming , 1996, Encyclopedia of Machine Learning.

[72]  Martin Natter,et al.  Adaptive Methods in Macroeconomic Forecasting , 1997, Intell. Syst. Account. Finance Manag..

[73]  Richard D. Braatz,et al.  On the "Identification and control of dynamical systems using neural networks" , 1997, IEEE Trans. Neural Networks.

[74]  F. Diebold,et al.  The Past, Present, and Future of Macroeconomic Forecasting , 2020, Business Cycles.

[75]  Lizhong Wu,et al.  Optimization of trading systems and portfolios , 1997, Proceedings of the IEEE/IAFE 1997 Computational Intelligence for Financial Engineering (CIFEr).

[76]  Norman R. Swanson,et al.  A Model Selection Approach to Real-Time Macroeconomic Forecasting Using Linear Models and Artificial Neural Networks , 1997, Review of Economics and Statistics.

[77]  John Moody,et al.  Reinforcement Learning for Trading Systems and Portfolios: Immediate vs Future Rewards , 1998 .

[78]  J. Moody,et al.  Performance functions and reinforcement learning for trading systems and portfolios , 1998 .

[79]  N. C. R. O B E R T S O N A N D E L L I,et al.  Data Vintages and Measuring Forecast Model Performance , 1998 .

[80]  Philip Hans Franses,et al.  Time Series Models for Business and Economic Forecasting , 1998 .

[81]  Matthew Saffell,et al.  Reinforcement Learning for Trading , 1998, NIPS.

[82]  Andrew W. Moore,et al.  Gradient Descent for General Reinforcement Learning , 1998, NIPS.

[83]  John N. Tsitsiklis,et al.  Optimal stopping of Markov processes: Hilbert space theory, approximation algorithms, and an application to pricing high-dimensional financial derivatives , 1999, IEEE Trans. Autom. Control..

[84]  John N. Tsitsiklis,et al.  Actor-Critic Algorithms , 1999, NIPS.

[85]  Yishay Mansour,et al.  Policy Gradient Methods for Reinforcement Learning with Function Approximation , 1999, NIPS.

[86]  Norman R. Swanson,et al.  Nonlinear Econometric Modelling: A Selective Review , 1999 .

[87]  P. Bartlett,et al.  Direct Gradient-Based Reinforcement Learning: I. Gradient Estimation Algorithms , 1999 .

[88]  David N. Nawrocki A Brief History of Downside Risk Measures , 1999 .

[89]  Arthur E. Hoerl,et al.  Ridge Regression: Biased Estimation for Nonorthogonal Problems , 2000, Technometrics.

[90]  L. Ungar,et al.  Localizing Policy Gradient Estimates to Action Transitions , 2000 .

[91]  P. Franses The Econometric Modelling of Financial Time Series: Second Edition, Terence C. Mills, (Cambridge: Cambridge University Press, 1999) 380 pages, Paperback; ISBN 0521-62492-4 ($27.95). Hardback: ISBN 0521-62413-4 ($80.00) , 2000 .

[92]  John N. Tsitsiklis,et al.  Regression methods for pricing complex American-style options , 2001, IEEE Trans. Neural Networks.

[93]  Francis A. Longstaff,et al.  Valuing American Options by Simulation: A Simple Least-Squares Approach , 2001 .

[94]  John N. Tsitsiklis,et al.  Simulation-based optimization of Markov reward processes , 2001, IEEE Trans. Autom. Control..

[95]  Matthew Saffell,et al.  Learning to trade via direct reinforcement , 2001, IEEE Trans. Neural Networks.

[96]  Robert Buff Continuous Time Finance , 2002 .

[97]  Dana Ron,et al.  An Experimental and Theoretical Comparison of Model Selection Methods , 1995, COLT '95.

[98]  Ronald J. Williams,et al.  Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning , 2004, Machine Learning.

[99]  Peter Dayan,et al.  Technical Note: Q-Learning , 2004, Machine Learning.

[100]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[101]  Sean R Eddy,et al.  What is dynamic programming? , 2004, Nature Biotechnology.

[102]  A. E. Hoerl,et al.  American Society for Quality Ridge Regression : Biased Estimation for Nonorthogonal Problems , 2007 .

[103]  Susan A. Murphy,et al.  Monographs on statistics and applied probability , 1990 .

[104]  Jack L. Treynor,et al.  MUTUAL FUND PERFORMANCE* , 2007 .

[105]  Yoshua Bengio,et al.  TRAINING A NEURAL NETWORK WITH A FINANCIAL CRITERION RATHER THAN A PREDICTION CRITERION , 2007 .