Forecasting of sales by using fusion of machine learning techniques

Forecasting is an integral part of any organization for their decision-making process so that they can predict their targets and modify their strategy in order to improve their sales or productivity in the coming future. This paper evaluates and compares various machine learning models, namely, ARIMA, Auto Regressive Neural Network(ARNN), XGBoost, SVM, Hy-brid Models like Hybrid ARIMA-ARNN, Hybrid ARIMA-XGBoost, Hybrid ARIMA-SVM and STL Decomposition (using ARIMA, Snaive, XGBoost) to forecast sales of a drug store company called Rossmann. Training data set contains past sales and supplemental information about drug stores. Accuracy of these models is measured by metrics such as MAE and RMSE. Initially, linear model such as ARIMA has been applied to forecast sales. ARIMA was not able to capture nonlinear patterns precisely, hence nonlinear models such as Neural Network, XGBoost and SVM were used. Nonlinear models performed better than ARIMA and gave low RMSE. Then, to further optimize the performance, composite models were designed using hybrid technique and decomposition technique. Hybrid ARIMA-ARNN, Hybrid ARIMA-XGBoost, Hybrid ARIMA-SVM were used and all of them performed better than their respective individual models. Then, the composite model was designed using STL Decomposition where the decomposed components namely seasonal, trend and remainder components were forecasted by Snaive, ARIMA and XGBoost. STL gave better results than individual and hybrid models. This paper evaluates and analyzes why composite models give better results than an individual model and state that decomposition technique is better than the hybrid technique for this application.

[1]  Ping Guo,et al.  A Novel Hybridization of Artificial Neural Networks and ARIMA Models for Forecasting Resource Consumption in an IIS Web Server , 2014, 2014 IEEE International Symposium on Software Reliability Engineering Workshops.

[2]  Muhammad Hisyam Lee,et al.  Two-level seasonal model based on hybrid ARIMA-ANFIS for forecasting short-term electricity load in Indonesia , 2012, 2012 International Conference on Statistics in Science, Business and Engineering (ICSSBE).

[3]  J. Contreras,et al.  ARIMA Models to Predict Next-Day Electricity Prices , 2002, IEEE Power Engineering Review.

[4]  Jiwei Liu,et al.  Connecting Devices to Cookies via Filtering, Feature Engineering, and Boosting , 2015, 2015 IEEE International Conference on Data Mining Workshop (ICDMW).

[5]  Akhter Mohiuddin Rather,et al.  A prediction based approach for stock returns using autoregressive neural networks , 2011, 2011 World Congress on Information and Communication Technologies.

[6]  David S. Ebert,et al.  Spatiotemporal social media analytics for abnormal event detection and examination using seasonal-trend decomposition , 2012, 2012 IEEE Conference on Visual Analytics Science and Technology (VAST).

[7]  Jianguo Wei,et al.  A hybrid statistical approach for stock market forecasting based on Artificial Neural Network and ARIMA time series models , 2015, 2015 International Conference on Behavioral, Economic and Socio-cultural Computing (BESC).

[8]  Chikkannan Eswaran,et al.  A Comparison of ARIMA, Neural Network and Linear Regression Models for the Prediction of Infant Mortality Rate , 2010, 2010 Fourth Asia International Conference on Mathematical/Analytical Modelling and Computer Simulation.

[9]  Thakur Raj Anand,et al.  Machine Learning Approach to Identify Users Across Their Digital Devices , 2015, 2015 IEEE International Conference on Data Mining Workshop (ICDMW).

[10]  Ponnuthurai N. Suganthan,et al.  A hybrid ARIMA-DENFIS method for wind speed forecasting , 2013, 2013 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE).

[11]  Jung-Hua Lo,et al.  A study of applying ARIMA and SVM model to software reliability prediction , 2011, 2011 International Conference on Uncertainty Reasoning and Knowledge Engineering.

[12]  Magdy M. A. Salama,et al.  Application of the decomposition technique for forecasting the load of a large electric power network , 1996 .

[13]  Wei Sun,et al.  Application of Time Series Based SVM Model on Next-Day Electricity Price Forecasting Under Deregulated Power Market , 2006, 2006 International Conference on Machine Learning and Cybernetics.

[14]  Yujun He,et al.  Research on Hybrid ARIMA and Support Vector Machine Model in Short Term Load Forecasting , 2006, Sixth International Conference on Intelligent Systems Design and Applications.

[15]  T. Senjyu,et al.  Notice of Violation of IEEE Publication PrinciplesA Hybrid ARIMA and Neural Network Model for Short-Term Price Forecasting in Deregulated Market , 2010, IEEE Transactions on Power Systems.

[16]  Robert Skopal Short-term hourly price forward curve prediction using neural network and hybrid ARIMA-NN model , 2015, 2015 International Conference on Information and Digital Technologies.

[17]  Zijun Zhang,et al.  Forecasting the electricity price in iran power market: A comparison between neural networks and time series methods , 2014, 2014 IEEE PES Asia-Pacific Power and Energy Engineering Conference (APPEEC).

[18]  Junzhong Gu,et al.  Comparative study among three different artificial neural networks to infectious diarrhea forecasting , 2014, 2014 IEEE International Conference on Bioinformatics and Biomedicine (BIBM).

[19]  Ravi Sankar,et al.  Time Series Prediction Using Support Vector Machines: A Survey , 2009, IEEE Computational Intelligence Magazine.

[20]  Daoping Chen,et al.  Chinese automobile demand prediction based on ARIMA model , 2011, 2011 4th International Conference on Biomedical Engineering and Informatics (BMEI).