论文信息 - A Deep Reinforcement Learning Approach for Automated Cryptocurrency Trading

A Deep Reinforcement Learning Approach for Automated Cryptocurrency Trading

Nowadays, Artificial Intelligence (AI) is changing our daily life in many application fields. Automatic trading has inspired a large number of field experts and scientists in developing innovative techniques and deploying cutting-edge technologies to trade different markets. In this context, cryptocurrency has given new interest in the application of AI techniques for predicting the future price of a financial asset. In this work Deep Reinforcement Learning is applied to trade bitcoin. More precisely, Double and Dueling Double Deep Q-learning Networks are compared over a period of almost four years. Two reward functions are also tested: Sharpe ratio and profit reward functions. The Double Deep Q-learning trading system based on Sharpe ratio reward function demonstrated to be the most profitable approach for trading bitcoin.

Giorgio Lucarelli | Matteo Borrotti | G. Lucarelli | M. Borrotti | Giorgio Lucarelli

[1] Zhengyao Jiang,et al. Cryptocurrency portfolio management with deep reinforcement learning , 2016, 2017 Intelligent Systems Conference (IntelliSys).

[2] Matthew Saffell,et al. Learning to trade via direct reinforcement , 2001, IEEE Trans. Neural Networks.

[3] Sung-Bae Cho,et al. Learning Optimal Q-Function Using Deep Boltzmann Machine for Reliable Trading of Cryptocurrency , 2018, IDEAL.

[4] Feifei Li,et al. DeepLog: Anomaly Detection and Diagnosis from System Logs through Deep Learning , 2017, CCS.

[5] Peter Henderson,et al. An Introduction to Deep Reinforcement Learning , 2018, Found. Trends Mach. Learn..

[6] Stuart E. Dreyfus,et al. Applied Dynamic Programming , 1965 .

[7] Andrea Baronchelli,et al. Machine Learning the Cryptocurrency Market , 2018, Complex..

[8] Richard S. Sutton,et al. Introduction to Reinforcement Learning , 1998 .

[9] Long Ji Lin,et al. Programming Robots Using Reinforcement Learning and Teaching , 1991, AAAI.

[10] Peter Dayan,et al. Technical Note: Q-Learning , 2004, Machine Learning.

[11] Alex Graves,et al. Playing Atari with Deep Reinforcement Learning , 2013, ArXiv.

[12] Steve Y. Yang,et al. An adaptive portfolio trading system: A risk-return portfolio optimization using recurrent reinforcement learning with expected maximum drawdown , 2017, Expert Syst. Appl..

[13] W. Sharpe. The Sharpe Ratio , 1994 .

[14] Tom Schaul,et al. Dueling Network Architectures for Deep Reinforcement Learning , 2015, ICML.

[15] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.

[16] Simon Caton,et al. Predicting the Price of Bitcoin Using Machine Learning , 2018, 2018 26th Euromicro International Conference on Parallel, Distributed and Network-based Processing (PDP).

[17] Yagna Patel,et al. Optimizing Market Making using Multi-Agent Reinforcement Learning , 2018, ArXiv.

[18] Yuxi Li,et al. Deep Reinforcement Learning: An Overview , 2017, ArXiv.