Deep learning with long short-term memory networks and random forests for demand forecasting in multi-channel retail

This paper proposes a novel forecasting method that combines the deep learning method – long short-term memory (LSTM) networks and random forest (RF). The proposed method can model complex relationships of both temporal and regression type which gives it an edge in accuracy over other forecasting methods. We evaluated the new method on a real-world multivariate dataset from a multi-channel retailer. We benchmark the forecasting performance of the new proposition against neural networks, multiple regression, ARIMAX, LSTM networks, and RF. We employed forecasting performance metrics to measure bias, accuracy, and variance, and the empirical evidence suggests that the new proposition is (statistically) significantly better. Furthermore, our method ranks the explanatory variables in terms of their relative importance. The empirical evaluations are replicated for longer forecasting horizons, and online and offline channels and the same conclusions hold; thus, advocating for the robustness of our forecasting proposition as well as the suitability in multi-channel retail demand forecasting.

[1]  H. V. Dijk,et al.  Combined forecasts from linear and nonlinear time series models , 1999 .

[2]  Andrew Kusiak,et al.  From data to big data in production research: the past and future trends , 2019, Int. J. Prod. Res..

[3]  Max Kuhn,et al.  Building Predictive Models in R Using the caret Package , 2008 .

[4]  Zoubin Ghahramani,et al.  A Theoretically Grounded Application of Dropout in Recurrent Neural Networks , 2015, NIPS.

[5]  L. Campos,et al.  Combination of forecasts for the price of crude oil on the spot market , 2016 .

[6]  Jürgen Schmidhuber,et al.  Framewise phoneme classification with bidirectional LSTM and other neural network architectures , 2005, Neural Networks.

[7]  Evgeny A. Antipov,et al.  Mass Appraisal of Residential Apartments: An Application of Random Forest for Valuation and a CART-Based Approach for Model Diagnostics , 2010, Expert Syst. Appl..

[8]  F. Diebold,et al.  Comparing Predictive Accuracy , 1994, Business Cycles.

[9]  E. H. Grosse,et al.  Logistics 4.0: a systematic review towards a new logistics system , 2019, Int. J. Prod. Res..

[10]  Haiyan Lu,et al.  A case study on a hybrid wind speed forecasting method using BP neural network , 2011, Knowl. Based Syst..

[11]  Del Moral HernandezEmilio 2005 Special Issue , 2005 .

[12]  Alexander H. Hübner,et al.  Retail logistics in the transition from multi-channel to omni-channel , 2016 .

[13]  Fotios Petropoulos,et al.  forecast: Forecasting functions for time series and linear models , 2018 .

[14]  S. Raj,et al.  Reference Price Research: Review and Propositions , 2005 .

[15]  Manoj Kumar Tiwari,et al.  Demand prediction and price optimization for semi-luxury supermarket segment , 2017, Comput. Ind. Eng..

[16]  A. Potter,et al.  Social media in operations and supply chain management: State-of-the-Art and research directions , 2020, Int. J. Prod. Res..

[17]  Ping-Feng Pai,et al.  A hybrid ARIMA and support vector machines model in stock price forecasting , 2005 .

[18]  Yossi Aviv,et al.  On the Benefits of Collaborative Forecasting Partnerships Between Retailers and Manufacturers , 2007, Manag. Sci..

[19]  Min Guo,et al.  Iot based laundry services: an application of big data analytics, intelligent logistics management, and machine learning techniques , 2020, Int. J. Prod. Res..

[20]  SchmidhuberJürgen,et al.  2005 Special Issue , 2005 .

[21]  Takahiro Watanabe,et al.  Document Analysis and Recognition , 1999, Communications in Computer and Information Science.

[22]  M. Fitzgerald,et al.  Horses for courses. , 2004, International journal of nursing practice.

[23]  John E. Boylan,et al.  State-space ARIMA for supply-chain forecasting , 2019, Int. J. Prod. Res..

[24]  Nikhil Ketkar,et al.  Deep Learning with Python , 2017 .

[25]  Kenneth Gilbert,et al.  An ARIMA Supply Chain Model , 2005, Manag. Sci..

[26]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[27]  Fei-Yue Wang,et al.  Traffic Flow Prediction With Big Data: A Deep Learning Approach , 2015, IEEE Transactions on Intelligent Transportation Systems.

[28]  Qi Wu,et al.  A hybrid-forecasting model based on Gaussian support vector machine and chaotic particle swarm optimization , 2010, Expert Syst. Appl..

[29]  S. Minner,et al.  A review on supply chain contracting with information considerations: information updating and information asymmetry , 2019, Int. J. Prod. Res..

[30]  Carlo Meloni,et al.  A reliable decision support system for fresh food supply chain management , 2018, Int. J. Prod. Res..

[31]  Kut C. So,et al.  Impact of an "online-to-store" channel on demand allocation, pricing and profitability , 2016, Eur. J. Oper. Res..

[32]  Mark E. Ferguson,et al.  PRODUCTION AND OPERATIONS MANAGEMENT , 2008 .

[33]  M. Hashem Pesaran,et al.  A Simple Nonparametric Test of Predictive Performance , 1992 .

[34]  Jesús A. Rodríguez-Sarasty,et al.  Analogue-based demand forecasting of short life-cycle products: a regression approach and a comprehensive assessment , 2017, Int. J. Prod. Res..

[35]  Sébastien Thomassey,et al.  Sales forecasts in clothing industry: The key success factor of the supply chain management , 2010 .

[36]  Ming Liu,et al.  Service-oriented bi-objective robust collection-disassembly problem with equipment selection , 2020, Int. J. Prod. Res..

[37]  Andy Liaw,et al.  Classification and Regression by randomForest , 2007 .

[38]  Max Kuhn,et al.  The caret Package , 2007 .

[39]  Thomas Fischer,et al.  Deep learning with long short-term memory networks for financial market predictions , 2017, Eur. J. Oper. Res..

[40]  Mario Cools,et al.  The use of time series forecasting in zone order picking systems to predict order pickers’ workload , 2017, Int. J. Prod. Res..

[41]  Borja Ponte,et al.  Applying machine learning to the dynamic selection of replenishment policies in fast-changing supply chain environments , 2018, Int. J. Prod. Res..

[42]  Marwan Khammash,et al.  Forecasting branded and generic pharmaceuticals , 2016 .

[43]  David D. Cox,et al.  Hyperopt: A Python Library for Optimizing the Hyperparameters of Machine Learning Algorithms , 2013, SciPy.

[44]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[45]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[46]  Tugba Taskaya-Temizel,et al.  2005 Special Issue: A comparative study of autoregressive neural network hybrids , 2005 .

[47]  Yoshua Bengio,et al.  Algorithms for Hyper-Parameter Optimization , 2011, NIPS.

[48]  Stefan Fritsch,et al.  neuralnet: Training of Neural Networks , 2010, R J..

[49]  Juuso Liesiö,et al.  Forecasting replenishment orders in retail: value of modelling low and intermittent consumer demand with distributions , 2018, Int. J. Prod. Res..

[50]  Daria Dzyabura,et al.  Offline Assortment Optimization in the Presence of an Online Channel , 2018, Manag. Sci..

[51]  M. Qi,et al.  Forecasting Aggregate Retail Sales: a Comparison of Arti"cial Neural Networks and Traditional Methods , 2001 .

[52]  Mehdi Khashei,et al.  A novel hybridization of artificial neural networks and ARIMA models for time series forecasting , 2011, Appl. Soft Comput..

[53]  Sandra Alemany,et al.  An ensemble of ordered logistic regression and random forest for child garment size matching , 2016, Comput. Ind. Eng..

[54]  Fotios Petropoulos,et al.  'Horses for Courses' in demand forecasting , 2014, Eur. J. Oper. Res..

[55]  George Sugihara,et al.  Nonlinear forecasting as a way of distinguishing chaos from measurement error in time series , 1990, Nature.

[56]  Haralambos Sarimveis,et al.  A combined model predictive control and time series forecasting framework for production-inventory systems , 2008 .

[57]  K. Nikolopoulos,et al.  The theta model: a decomposition approach to forecasting , 2000 .

[58]  Steven J. Erlebacher OPTIMAL AND HEURISTIC SOLUTIONS FOR THE MULTI‐ITEM NEWSVENDOR PROBLEM WITH A SINGLE CAPACITY CONSTRAINT , 2000 .

[59]  David Simchi-Levi,et al.  Analytics for an Online Retailer: Demand Forecasting and Price Optimization , 2016, Manuf. Serv. Oper. Manag..