Gaussian Weighting Reversion Strategy for Accurate Online Portfolio Selection

In this paper, we design and implement a new on-line portfolio selection strategy based on reversion mechanism and weighted on-line learning. Our strategy, called “Gaussian Weighting Reversion” (GWR), improves the reversion estimator to form optimal portfolios and effectively overcomes the shortcomings of existing on-line portfolio selection strategies. Firstly, GWR uses Gaussian function to weight data in a sliding window to exploit the “time validity” of historical market data. It means that the more recent data are more valuable for market prediction than the earlier. Secondly, the self-learning for various sliding windows is created to make our strategy adaptive to different markets. In addition, double estimations are first proposed to be made at each time point, and the average of double estimations is obtained to alleviate the influence of noise and outliers. Extensive evaluation on six public datasets shows the advantages of our strategy compared with other nine competing strategies, including the state-of-the-art ones. Finally, the complexity analysis of GWR shows its availability in large-scale real-life online trading.

[1]  Matthew Saffell,et al.  Learning to trade via direct reinforcement , 2001, IEEE Trans. Neural Networks.

[2]  Aurelio Tommasetti,et al.  Flatness-based adaptive fuzzy control of chaotic finance dynamics , 2017 .

[3]  Bin Li,et al.  Confidence Weighted Mean Reversion Strategy for Online Portfolio Selection , 2011, TKDD.

[4]  Yoram Singer,et al.  Efficient projections onto the l1-ball for learning in high dimensions , 2008, ICML '08.

[5]  Bin Li,et al.  Robust Median Reversion Strategy for Online Portfolio Selection , 2013, IEEE Transactions on Knowledge and Data Engineering.

[6]  Allan Borodin,et al.  Can We Learn to Beat the Best Stock , 2003, NIPS.

[7]  Zhengyao Jiang,et al.  A Deep Reinforcement Learning Framework for the Financial Portfolio Management Problem , 2017, ArXiv.

[8]  Cem Tekin,et al.  Online Learning in Limit Order Book Trade Execution , 2018, IEEE Transactions on Signal Processing.

[9]  Alberto Ferreira de Souza,et al.  Prediction-based portfolio optimization model using neural networks , 2009, Neurocomputing.

[10]  Yoram Singer,et al.  On‐Line Portfolio Selection Using Multiplicative Updates , 1998, ICML.

[11]  Irene Aldridge,et al.  High-frequency Trading High-frequency Trading Industry Strategy Project Engineering Leadership Program , 2022 .

[12]  Ramsey Michael Faragher,et al.  Understanding the Basis of the Kalman Filter Via a Simple and Intuitive Derivation [Lecture Notes] , 2012, IEEE Signal Processing Magazine.

[13]  Kai Huang,et al.  Gaussian Weighting Reversion Strategy for Accurate On-Line Portfolio Selection , 2017, 2017 IEEE 29th International Conference on Tools with Artificial Intelligence (ICTAI).

[14]  Yanran Li,et al.  Adversarial Deep Reinforcement Learning in Portfolio Management , 2018 .

[15]  T. L. Lai Andherbertrobbins Asymptotically Efficient Adaptive Allocation Rules , 1985 .

[16]  Fabio Stella,et al.  Stochastic Nonstationary Optimization for Finding Universal Portfolios , 2000, Ann. Oper. Res..

[17]  Kamran Usmani An Investigation into the Use of Reinforcement Learning Techniques within the Algorithmic Trading Domain , 2015 .

[18]  Erik Ordentlich,et al.  On-line portfolio selection , 1996, COLT '96.

[19]  László Györfi,et al.  Nonparametric nearest neighbor based empirical portfolio selection strategies , 2008 .

[20]  Steven C. H. Hoi,et al.  PAMR: Passive aggressive mean reversion strategy for portfolio selection , 2012, Machine Learning.

[21]  G. Connor Active Portfolio Management: A Quantitative Approach to Providing Superior Returns and Controlling Risk , 2000 .

[22]  Robert E. Schapire,et al.  Algorithms for portfolio management based on the Newton method , 2006, ICML.

[23]  Joelle Pineau,et al.  The Bottleneck Simulator: A Model-based Deep Reinforcement Learning Approach , 2018, J. Artif. Intell. Res..

[24]  R. C. Merton,et al.  On Estimating the Expected Return on the Market: An Exploratory Investigation , 1980 .

[25]  Adam Tauman Kalai,et al.  Universal Portfolios With and Without Transaction Costs , 2004, Machine Learning.

[26]  Koby Crammer,et al.  Online Passive-Aggressive Algorithms , 2003, J. Mach. Learn. Res..

[27]  Jianbin Qiu,et al.  Adaptive Fuzzy Control for Nontriangular Structural Stochastic Switched Nonlinear Systems With Full State Constraints , 2019, IEEE Transactions on Fuzzy Systems.

[28]  Bin Li,et al.  Moving average reversion strategy for on-line portfolio selection , 2015, Artif. Intell..

[29]  Jan Hendrik Witte,et al.  Deep Learning for Finance: Deep Portfolios , 2016 .

[30]  Garud N. Iyengar Growth Optimal Investment with Transaction Costs , 1998 .

[31]  Steven C. H. Hoi,et al.  Online portfolio selection: A survey , 2012, CSUR.

[32]  Bin Li,et al.  CORN: Correlation-driven nonparametric learning approach for portfolio selection , 2011, TIST.

[33]  Rubén Saborido,et al.  Evolutionary multi-objective optimization algorithms for fuzzy portfolio selection , 2016, Appl. Soft Comput..

[34]  Bin Li,et al.  On-Line Portfolio Selection with Moving Average Reversion , 2012, ICML.

[35]  Youyong Kong,et al.  Deep Direct Reinforcement Learning for Financial Signal Representation and Trading , 2017, IEEE Transactions on Neural Networks and Learning Systems.

[36]  Csaba Szepesvári,et al.  Improved Algorithms for Linear Stochastic Bandits , 2011, NIPS.

[37]  H. Robbins Some aspects of the sequential design of experiments , 1952 .

[38]  G. Lugosi,et al.  NONPARAMETRIC KERNEL‐BASED SEQUENTIAL INVESTMENT STRATEGIES , 2006 .

[39]  Bin Li,et al.  Semi-Universal Portfolios with Transaction Costs , 2015, IJCAI.

[40]  Daniel Pérez Palomar,et al.  Robust Optimization of Order Execution , 2015, IEEE Transactions on Signal Processing.

[41]  W. Sharpe The Sharpe Ratio , 1994 .

[42]  John L. Kelly,et al.  A new interpretation of information rate , 1956, IRE Trans. Inf. Theory.

[43]  Peter Auer,et al.  Finite-time Analysis of the Multiarmed Bandit Problem , 2002, Machine Learning.

[44]  Jianbin Qiu,et al.  Observer-Based Fuzzy Adaptive Event-Triggered Control for Pure-Feedback Nonlinear Systems With Prescribed Performance , 2019, IEEE Transactions on Fuzzy Systems.

[45]  T. Cover Universal Portfolios , 1996 .

[46]  António Rua,et al.  International comovement of stock market returns: a wavelet analysis , 2009 .