Optimization of a Comprehensive Sequence Forecasting Framework Based on DAE-LSTM Algorithm

Sales forecasting is a multivariate time series forecasting problem. The main challenges of the forecasting task are the high-dimensional influence variables with noise and the complex time series relationships. In this paper, we propose a DAE-LSTM algorithm combined with denoising autoencoder(DAE) and long short-term memory(LSTM) to deal with this problem. The DAE is a non-linear dimensionality reduction method that enhances data robustness. The LSTM is a deep learning algorithm suitable for dealing with multivariate time series problem. Combining the advantages of the above two algorithms, DAE-LSTM is a new multivariate time series prediction algorithm, which transforms unsupervised DAE into supervised data feature extraction method, and adds feature selection function to LSTM algorithm. In addition, in order to build a comprehensive forecasting framework, the hyperparameters of the model are determined by Bayesian optimization method(BOM) and the parameters of the model are solved by adaptive moment estimation(ADAM) optimization algorithm. In the training process, due to the adaptability of the BOM and ADAM, the forecasting model has few parameters that need to be set artificially, only including dimensional scale and time steps determined by data. Finally, the proposed forecasting framework is applied to forecast the sales of online retailers on JD.COM. In comparison with other machine learning algorithms, including multiple linear regression(MLR), support vector regression(SVR), artificial neural network(ANN) and LSTM, the proposed DAE-LSTM achieves the best prediction accuracy.

[1]  Rankothge Gishan Hiranya Pemathilake,et al.  Sales Forecasting Based on AutoRegressive Integrated Moving Average and Recurrent Neural Network Hybrid Model , 2018, 2018 14th International Conference on Natural Computation, Fuzzy Systems and Knowledge Discovery (ICNC-FSKD).

[2]  Yuquan Qin,et al.  Sales forecast based on BP neural network , 2011, 2011 IEEE 3rd International Conference on Communication Software and Networks.

[3]  Shahaboddin Shamshirband,et al.  RETRACTED ARTICLE: Support vector regression methodology for prediction of input displacement of adaptive compliant robotic gripper , 2014, Applied Intelligence.

[4]  Alex Graves,et al.  Supervised Sequence Labelling with Recurrent Neural Networks , 2012, Studies in Computational Intelligence.

[5]  Shahaboddin Shamshirband,et al.  Retraction Note to: Support vector regression methodology for prediction of input displacement of adaptive compliant robotic gripper , 2018, Applied Intelligence.

[6]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[7]  Carl E. Rasmussen,et al.  Gaussian processes for machine learning , 2005, Adaptive computation and machine learning.

[8]  Hongliang Zhu,et al.  A Universal High-Performance Correlation Analysis Detection Model and Algorithm for Network Intrusion Detection System , 2017 .

[9]  Anila Cyril,et al.  Modelling and Forecasting Bus Passenger Demand using Time Series Method , 2018, 2018 7th International Conference on Reliability, Infocom Technologies and Optimization (Trends and Future Directions) (ICRITO).

[10]  Jasper Snoek,et al.  Practical Bayesian Optimization of Machine Learning Algorithms , 2012, NIPS.

[11]  Long Yu,et al.  Bidirectional LSTM Malicious webpages detection algorithm based on convolutional neural network and independent recurrent neural network , 2019, Applied Intelligence.

[12]  Jesús Alcalá-Fdez,et al.  Improving fuzzy logic controllers obtained by experts: a case study in HVAC systems , 2009, Applied Intelligence.

[13]  Zijun Zhang,et al.  Short-Term Electricity Price Forecasting With Stacked Denoising Autoencoders , 2017, IEEE Transactions on Power Systems.

[14]  Walter D. Potter,et al.  Comparison of individual and combined ANN models for prediction of air and dew point temperature , 2013, Applied Intelligence.

[15]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[16]  Zheng Hu,et al.  Taxonomy-aware collaborative denoising autoencoder for personalized recommendation , 2019, Applied Intelligence.

[17]  Nando de Freitas,et al.  A Tutorial on Bayesian Optimization of Expensive Cost Functions, with Application to Active User Modeling and Hierarchical Reinforcement Learning , 2010, ArXiv.

[18]  Chi-Jie Lu,et al.  Sales forecasting of computer products based on variable selection scheme and support vector regression , 2014, Neurocomputing.

[19]  Zhi-Ping Fan,et al.  Product sales forecasting using online reviews and historical sales data: A method combining the Bass model and sentiment analysis , 2017 .

[20]  Francesco Contino,et al.  A hyperparameters selection technique for support vector regression models , 2017, Appl. Soft Comput..

[21]  Yoshua Bengio,et al.  Extracting and composing robust features with denoising autoencoders , 2008, ICML '08.

[22]  Gopalakrishnan T,et al.  Prediction of Sales Value in Online shopping using Linear Regression , 2018, 2018 4th International Conference on Computing Communication and Automation (ICCCA).