A Multi-Scale Temporal Feature Aggregation Convolutional Neural Network for Portfolio Management

Financial portfolio management is the process of periodically reallocating a fund into different financial investment products, with the goal of achieving the maximum profits. While conventional financial machine learning methods try to predict the price trends, reinforcement learning based portfolio management methods makes trading decisions according to the price changes directly. However, existing reinforcement learning based methods are limited in extracting the price change information at single-scale level, which makes their performance still not satisfactory. In this paper, inspired by the Inception network that has achieved great success in computer vision and can extract multi-scale features simultaneously, we propose a novel Ensemble of Identical Independent Inception (EI$^3$) convolutional neural network, with the objective of addressing the limitation of existing reinforcement learning based portfolio management methods. With EI$^3$, multiple assets can be processed independently while sharing the same network parameters. Moreover, price movement information for each product can be extracted at multiple scales via wide network and then aggregated to make trading decision. Based on EI$^3$, we further propose a recurrent reinforcement learning framework to provide a deep machine learning solution for the portfolio management problem. Comprehensive experiments on the cryptocurrency datasets demonstrate the superiority of our method over existing competitors, in both upswing and downswing environments.

[1]  Guy Lever,et al.  Deterministic Policy Gradient Algorithms , 2014, ICML.

[2]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[3]  Bin Li,et al.  CORN: Correlation-driven nonparametric learning approach for portfolio selection , 2011, TIST.

[4]  Bin Li,et al.  Robust Median Reversion Strategy for Online Portfolio Selection , 2013, IEEE Transactions on Knowledge and Data Engineering.

[5]  Allan Borodin,et al.  Can We Learn to Beat the Best Stock , 2003, NIPS.

[6]  Geoffrey E. Hinton,et al.  Learning representations by back-propagating errors , 1986, Nature.

[7]  Tony Guida,et al.  Big Data and Machine Learning in Quantitative Investment , 2018 .

[8]  Yoram Singer,et al.  On‐Line Portfolio Selection Using Multiplicative Updates , 1998, ICML.

[9]  G. Lugosi,et al.  NONPARAMETRIC KERNEL‐BASED SEQUENTIAL INVESTMENT STRATEGIES , 2006 .

[10]  Gerald Appel,et al.  Technical Analysis: Power Tools for Active Investors , 2005 .

[11]  Stelios D. Bekiros,et al.  Heterogeneous trading strategies with adaptive fuzzy Actor-Critic reinforcement learning: A behavioral approach , 2010 .

[12]  J. Moody,et al.  Performance functions and reinforcement learning for trading systems and portfolios , 1998 .

[13]  Weiguo Zhang,et al.  Weighted Moving Average Passive Aggressive Algorithm for Online Portfolio Selection , 2013, 2013 5th International Conference on Intelligent Human-Machine Systems and Cybernetics.

[14]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[15]  Steven C. H. Hoi,et al.  Online portfolio selection: A survey , 2012, CSUR.

[16]  Alexandros Iosifidis,et al.  Temporal Attention-Augmented Bilinear Network for Financial Time-Series Data Analysis , 2017, IEEE Transactions on Neural Networks and Learning Systems.

[17]  Yoshua Bengio,et al.  Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation , 2014, EMNLP.

[18]  Youyong Kong,et al.  Deep Direct Reinforcement Learning for Financial Signal Representation and Trading , 2017, IEEE Transactions on Neural Networks and Learning Systems.

[19]  Yan Liu,et al.  Recurrent Neural Networks for Multivariate Time Series with Missing Values , 2016, Scientific Reports.

[20]  Robert E. Schapire,et al.  Algorithms for portfolio management based on the Newton method , 2006, ICML.

[21]  G. Ritter Reinforcement Learning in Finance , 2018, Big Data and Machine Learning in Quantitative Investment.

[22]  Aderemi Oluyinka Adewumi,et al.  Stock Price Prediction Using the ARIMA Model , 2014, 2014 UKSim-AMSS 16th International Conference on Computer Modelling and Simulation.

[23]  Thomas G. Fischer,et al.  Reinforcement learning in financial markets - a survey , 2018 .

[24]  Zhengyao Jiang,et al.  Cryptocurrency portfolio management with deep reinforcement learning , 2016, 2017 Intelligent Systems Conference (IntelliSys).

[25]  Steven C. H. Hoi,et al.  PAMR: Passive aggressive mean reversion strategy for portfolio selection , 2012, Machine Learning.

[26]  Allan Borodin,et al.  On the Competitive Theory and Practice of Portfolio Selection (Extended Abstract) , 2000, LATIN.

[27]  Marco Corazza,et al.  Q-Learning-Based Financial Trading Systems with Applications , 2014 .

[28]  Steve Y. Yang,et al.  An adaptive portfolio trading system: A risk-return portfolio optimization using recurrent reinforcement learning with expected maximum drawdown , 2017, Expert Syst. Appl..

[29]  Bin Li,et al.  On-Line Portfolio Selection with Moving Average Reversion , 2012, ICML.

[30]  Yoshua Bengio,et al.  Understanding the difficulty of training deep feedforward neural networks , 2010, AISTATS.

[31]  W. Sharpe The Sharpe Ratio , 1994 .

[32]  Zhengyao Jiang,et al.  A Deep Reinforcement Learning Framework for the Financial Portfolio Management Problem , 2017, ArXiv.

[33]  T. Cover Universal Portfolios , 1996 .

[34]  Dumitru Erhan,et al.  Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[35]  Amir F. Atiya,et al.  The maximum drawdown of the Brownian motion , 2003, 2003 IEEE International Conference on Computational Intelligence for Financial Engineering, 2003. Proceedings..

[36]  Kilian Q. Weinberger,et al.  Densely Connected Convolutional Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).