论文信息 - A novel CNN-DDPG based AI-trader: Performance and roles in business operations

A novel CNN-DDPG based AI-trader: Performance and roles in business operations

Abstract Artificial Intelligence (AI) is well-developed as a part of human life. In both financial markets and business operations, AI is getting more and more important. In this paper, we build a novel “Reinforcement Learning” (RL) framework based AI-trader. We adopt an actor-critic RL algorithm called “Deep Deterministic Policy Gradient” (DDPG) to find the optimal policy. Our proposed DDPG has two different convolutional neutral networks (CNNs) based function approximators. The proposed AI-trader’s performance is shown to outperform other methods with the use of real stock-index future data. We further discuss the generalization and implications of the proposed method for business operations.

[1] Maxime C. Cohen. Big Data and Service Operations , 2018 .

[2] A. Murat Ozbayoglu,et al. Algorithmic financial trading with deep convolutional neural networks: Time series to image conversion approach , 2018, Appl. Soft Comput..

[3] Ruonan Rao,et al. A Multi-objective Deep Reinforcement Learning Approach for Stock Index Future’s Intraday Trading , 2017, 2017 10th International Symposium on Computational Intelligence and Design (ISCID).

[4] Richard S. Sutton,et al. Learning to predict by the methods of temporal differences , 1988, Machine Learning.

[5] Sergey Levine,et al. Trust Region Policy Optimization , 2015, ICML.

[6] Tsan-Ming Choi,et al. Big Data Analytics in Operations Management , 2018 .

[7] Bin Yu,et al. Flight delay prediction for commercial air transport: A deep learning approach , 2019, Transportation Research Part E: Logistics and Transportation Review.

[8] John N. Tsitsiklis,et al. Actor-Critic Algorithms , 1999, NIPS.

[9] Yong Yu,et al. Sales forecasting using extreme learning machine with applications in fashion retailing , 2008, Decis. Support Syst..

[10] Dimitris A. Tsouknidis,et al. A survey of shipping finance research: Setting the future research agenda , 2018, Transportation Research Part E: Logistics and Transportation Review.

[11] Lukas Menkhoff. The use of technical analysis by fund managers: International evidence , 2010 .

[12] T. Choi,et al. The mean-variance approach for global supply chain risk analysis with air logistics in the blockchain technology era , 2019, Transportation Research Part E: Logistics and Transportation Review.

[13] David de la Fuente,et al. Neural networks in financial trading , 2019, Annals of Operations Research.

[14] Matthew Saffell,et al. Learning to trade via direct reinforcement , 2001, IEEE Trans. Neural Networks.

[15] A. Murat Ozbayoglu,et al. A deep learning based stock trading model with 2-D CNN trend detection , 2017, 2017 IEEE Symposium Series on Computational Intelligence (SSCI).

[16] Niraj Kumar,et al. Understanding big data analytics capabilities in supply chain management: Unravelling the issues, challenges and implications for practice , 2017, Transportation Research Part E: Logistics and Transportation Review.

[17] Michael L. Littman,et al. Markov Games as a Framework for Multi-Agent Reinforcement Learning , 1994, ICML.

[18] Guofu Zhou,et al. Forecasting the Equity Risk Premium: The Role of Technical Indicators , 2011, Manag. Sci..

[19] Michael A. H. Dempster,et al. Computational learning techniques for intraday FX trading using popular technical indicators , 2001, IEEE Trans. Neural Networks.

[20] Babak Nadjar Araabi,et al. Online Forecasting of Synchronous Time Series Based on Evolving Linear Models , 2020, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[21] Nishikant Mishra,et al. Social media data analytics to improve supply chain management in food industries , 2017, Transportation Research Part E: Logistics and Transportation Review.

[22] Ebrahimi Atani Reza,et al. Stock market forecasting using artificial neural networks , 2014 .

[23] M A H Dempster,et al. An automated FX trading system using adaptive reinforcement learning , 2006, Expert Syst. Appl..

[24] Guy Lever,et al. Deterministic Policy Gradient Algorithms , 2014, ICML.

[25] Marco Corazza,et al. Testing different Reinforcement Learning con?gurations for ?nancial trading: Introduction and applications , 2018 .

[26] T. Choi. Incorporating social media observations and bounded rationality into fashion quick response supply chains in the big data era , 2016, Transportation Research Part E: Logistics and Transportation Review.

[27] Pierre Baldi,et al. From Reinforcement Learning to Deep Reinforcement Learning: An Overview , 2017, Braverman Readings in Machine Learning.

[28] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.

[29] Lucas P. Veelenturf,et al. The strategic role of logistics in the industry 4.0 era , 2019, Transportation Research Part E: Logistics and Transportation Review.

[30] Raymond Y. K. Lau,et al. Parallel Aspect‐Oriented Sentiment Analysis for Sales Forecasting with Big Data , 2018 .

[31] Demis Hassabis,et al. Mastering the game of Go with deep neural networks and tree search , 2016, Nature.

[32] Tsan-Ming Choi,et al. Supply chain risk analysis with mean-variance models: a technical review , 2016, Ann. Oper. Res..

[33] Sergey Levine,et al. Deep reinforcement learning for robotic manipulation with asynchronous off-policy updates , 2016, 2017 IEEE International Conference on Robotics and Automation (ICRA).

[34] Jonghun Park,et al. A Multiagent Approach to $Q$-Learning for Daily Stock Trading , 2007, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.

[35] Tilman Matteis,et al. A machine learning approach for the operationalization of latent classes in a discrete shipment size choice model , 2019, Transportation Research Part E: Logistics and Transportation Review.

[36] Hing Kai Chan,et al. Forecasting the demand of the aviation industry using hybrid time series SARIMA-SVR approach , 2019, Transportation Research Part E: Logistics and Transportation Review.

[37] Kurt Hornik,et al. Multilayer feedforward networks are universal approximators , 1989, Neural Networks.

[38] Germán Hernández,et al. High-Frequency Trading Strategy Based on Deep Neural Networks , 2016, ICIC.

[39] Erdogan Dogdu,et al. An Artificial Neural Network-based Stock Trading System Using Technical Analysis and Big Data Framework , 2017, ACM Southeast Regional Conference.

[40] Bin Yang,et al. Correlated Time Series Forecasting using Multi-Task Deep Neural Networks , 2018, CIKM.

[41] John Moody,et al. Reinforcement Learning for Trading Systems and Portfolios: Immediate vs Future Rewards , 1998 .

[42] Andrew Whinston,et al. Sentiment Manipulation in Online Platforms: An Analysis of Movie Tweets , 2017 .

[43] Tsan-Ming Choi,et al. Blockchain-technology-supported platforms for diamond authentication and certification in luxury supply chains , 2019, Transportation Research Part E: Logistics and Transportation Review.

[44] Ulf Johansson,et al. High-Frequency Equity Index Futures Trading Using Recurrent Reinforcement Learning with Candlesticks , 2015, 2015 IEEE Symposium Series on Computational Intelligence.

[45] Youyong Kong,et al. Deep Direct Reinforcement Learning for Financial Signal Representation and Trading , 2017, IEEE Transactions on Neural Networks and Learning Systems.

[46] Yoshua Bengio,et al. Convolutional networks for images, speech, and time series , 1998 .

[47] Trevor Cohn,et al. Day trading profit maximization with multi-task learning and technical analysis , 2014, Machine Learning.