ProfARIMA: A profit-driven order identification algorithm for ARIMA models in sales forecasting

Abstract In forecasting, evolutionary algorithms are often linked to existing forecasting methods to optimize their input parameters. Traditionally, the fitness function of these search heuristics is based on an accuracy measure. In this paper, however, we combine forecasting accuracy with business expertise by defining a flexible and easily interpretable profit function for sales forecasting, which is based on the profit margin of a given product, the volume of its sales and the accuracy of the forecast. ProfARIMA is a new procedure that selects the lags of a Seasonal ARIMA model according to the profit of a model's forecasts by taking advantage of search heuristics. This procedure is tested on both publicly available datasets and a real-life application with datasets of The Coca-Cola Company in order to assess its performance, both in profit and accuracy. Three different evolutionary algorithms were implemented during this testing process, i.e. Genetic Algorithms, Particle Swarm Optimization and Simulated Annealing. The results indicate that ProfARIMA always performs at least equally to the Box–Jenkins methodology and often outperforms this traditional procedure. For The Coca-Cola Company, our new algorithm in combination with Genetic Algorithms even leads to a significantly larger profit for out-of-sample forecasts.

[1]  Alex H. B. Duffy,et al.  Extended revenue forecasting within a service industry , 2013 .

[2]  Sen Guo,et al.  A hybrid annual power load forecasting model based on generalized regression neural network with fruit fly optimization algorithm , 2013, Knowl. Based Syst..

[3]  Yu Cheng,et al.  Seasonal and trend time series forecasting based on a quasi-linear autoregressive model , 2014, Appl. Soft Comput..

[4]  R. Król,et al.  Evaluating state revenue forecasting under a flexible loss function , 2013 .

[5]  Miki Haseyama,et al.  An ARMA order selection method with fuzzy reasoning , 2001, Signal Process..

[6]  Mohammad Moradzadeh,et al.  A novel hybrid algorithm for electricity price and load forecasting in smart grids with demand-side management , 2016 .

[7]  Aslam Muhammad,et al.  An efficient model based on artificial bee colony optimization algorithm with Neural Networks for electric load forecasting , 2014, Neural Computing and Applications.

[8]  Peter Whittle,et al.  Hypothesis Testing in Time Series Analysis. , 1951 .

[9]  Wei-Chiang Hong,et al.  Hybridization of seasonal chaotic cloud simulated annealing algorithm in a SVR-based load forecasting model , 2015, Neurocomputing.

[10]  Z. Abo-Hammour,et al.  ARMA model order and parameter estimation using genetic algorithms , 2012 .

[11]  Duanfeng Han,et al.  Vessel traffic flow forecasting by RSVR with chaotic cloud simulated annealing genetic algorithm and KPCA , 2015, Neurocomputing.

[12]  P. Young,et al.  Time series analysis, forecasting and control , 1972, IEEE Transactions on Automatic Control.

[13]  Tao Xu,et al.  An integrated chaotic time series prediction model based on efficient extreme learning machine and differential evolution , 2015, Neural Computing and Applications.

[14]  Jih-Jeng Huang,et al.  Model identification of ARIMA family using genetic algorithms , 2005, Appl. Math. Comput..

[15]  H. Akaike A new look at the statistical model identification , 1974 .

[16]  Xinjie Yu,et al.  Introduction to evolutionary algorithms , 2010, The 40th International Conference on Computers & Indutrial Engineering.

[17]  Yue Shi,et al.  A modified particle swarm optimizer , 1998, 1998 IEEE International Conference on Evolutionary Computation Proceedings. IEEE World Congress on Computational Intelligence (Cat. No.98TH8360).

[18]  Francesco Battaglia,et al.  Evolutionary Statistical Procedures: An Evolutionary Computation Approach to Statistical Procedures Designs and Applications , 2011 .

[19]  Thomas Stützle,et al.  F-Race and Iterated F-Race: An Overview , 2010, Experimental Methods for the Analysis of Optimization Algorithms.

[20]  C. Tsallis,et al.  Generalized simulated annealing , 1995, cond-mat/9501047.

[21]  Zuhairi Baharudin,et al.  Optimization of neural network architecture using genetic algorithm for load forecasting , 2014, 2014 5th International Conference on Intelligent and Advanced Systems (ICIAS).

[22]  M. Friedman A Comparison of Alternative Tests of Significance for the Problem of $m$ Rankings , 1940 .

[23]  Thomas Stützle,et al.  Improvement Strategies for the F-Race Algorithm: Sampling Design and Iterative Refinement , 2007, Hybrid Metaheuristics.

[24]  Sifeng Liu,et al.  Grey double exponential smoothing model and its application on pig price forecasting in China , 2016, Appl. Soft Comput..

[25]  Melda Akın,et al.  A novel approach to model selection in tourism demand modeling , 2015 .

[26]  Russell C. Eberhart,et al.  A new optimizer using particle swarm theory , 1995, MHS'95. Proceedings of the Sixth International Symposium on Micro Machine and Human Science.

[27]  Gwilym M. Jenkins,et al.  Time series analysis, forecasting and control , 1971 .

[28]  Bertrand Neveu,et al.  A beginner's guide to tuning methods , 2014, Appl. Soft Comput..

[29]  Dongxiao Niu,et al.  Short-term wind speed forecasting using wavelet transform and support vector machines optimized by genetic algorithm , 2014 .

[30]  Ke Wang,et al.  A hybrid self-adaptive Particle Swarm Optimization–Genetic Algorithm–Radial Basis Function model for annual electricity demand prediction , 2015 .

[31]  Jui-Chung Hung,et al.  Applying a combined fuzzy systems and GARCH model to adaptively forecast stock market volatility , 2011, Appl. Soft Comput..

[32]  Bart Baesens,et al.  A Novel Profit Maximizing Metric for Measuring Classification Performance of Customer Churn Prediction Models , 2013, IEEE Transactions on Knowledge and Data Engineering.

[33]  A. E. Eiben,et al.  Evolutionary Algorithm Parameters and Methods to Tune Them , 2012, Autonomous Search.

[34]  Murat Kankal,et al.  Neural network approach with teaching–learning-based optimization for modeling and forecasting long-term electric energy demand in Turkey , 2017, Neural Computing and Applications.

[35]  R. E. Lee,et al.  Distribution-free multiple comparisons between successive treatments , 1995 .

[36]  Chris Birchenhall,et al.  Seasonality and the Order of Integration for Consumption , 2009 .

[37]  Carlo Gaetan,et al.  Subset ARMA Model Identification Using Genetic Algorithms , 2000 .

[38]  R. K. Agrawal,et al.  A combination of artificial neural network and random walk models for financial time series forecasting , 2013, Neural Computing and Applications.

[39]  Eneko Osaba,et al.  A Hybrid Method for Short-Term Traffic Congestion Forecasting Using Genetic Algorithms and Cross Entropy , 2016, IEEE Transactions on Intelligent Transportation Systems.

[40]  Mingzhe Liu,et al.  Evolving RBF neural networks for rainfall prediction using hybrid particle swarm optimization and genetic algorithm , 2015, Neurocomputing.

[41]  Kenneth B. Kahn How to Measure the Impact of a Forecast Error on an Enterprise , 2003 .

[42]  C. L. Jain How to Measure the Cost of a Forecast Error , 2004 .

[43]  Robert Fildes,et al.  Principles of Business Forecasting , 2012 .

[44]  Francesco Battaglia,et al.  Estimating threshold subset autoregressive moving-average models by genetic algorithms , 2004 .

[45]  Abdollah Kavousi-Fard,et al.  A hybrid method based on wavelet, ANN and ARIMA model for short-term load forecasting , 2014, J. Exp. Theor. Artif. Intell..

[46]  N. Metropolis,et al.  Equation of State Calculations by Fast Computing Machines , 1953, Resonance.

[47]  Jui-Chung Hung,et al.  A genetic algorithm approach to the spectral estimation of time series with noise and missed observations , 2008, Inf. Sci..

[48]  Timon C. Du,et al.  Implementing support vector regression with differential evolution to forecast motherboard shipments , 2014, Expert Syst. Appl..

[49]  Scott Kirkpatrick,et al.  Optimization by Simmulated Annealing , 1983, Sci..

[50]  Riccardo Poli,et al.  Particle swarm optimization , 1995, Swarm Intelligence.

[51]  B. Eswara Reddy,et al.  A moving-average filter based hybrid ARIMA-ANN model for forecasting time series data , 2014, Appl. Soft Comput..

[52]  Wei Du,et al.  A multiobjective optimization-based neural network model for short-term replenishment forecasting in fashion industry , 2015, Neurocomputing.

[53]  Wei-Yuan Lin A novel 3D fruit fly optimization algorithm and its applications in economics , 2015, Neural Computing and Applications.

[54]  Vera Chung,et al.  Forecasting wind power in the Mai Liao Wind Farm based on the multi-layer perceptron artificial neural network model with improved simplified swarm optimization , 2014 .

[55]  Bart Baesens,et al.  New insights into churn prediction in the telecommunication sector: A profit driven data mining approach , 2012, Eur. J. Oper. Res..

[56]  Ronald Hochreiter,et al.  Evolving accuracy: A genetic algorithm to improve election night forecasts , 2015, Appl. Soft Comput..

[57]  Wei-Chiang Hong,et al.  A novel approach based on the Gauss-vSVR with a new hybrid evolutionary algorithm and input vector decision method for port throughput forecasting , 2017, Neural Computing and Applications.

[58]  Guoqiang Peter Zhang,et al.  Time series forecasting using a hybrid ARIMA and neural network model , 2003, Neurocomputing.

[59]  C. D. Gelatt,et al.  Optimization by Simulated Annealing , 1983, Science.

[60]  John H. Holland,et al.  Adaptation in Natural and Artificial Systems: An Introductory Analysis with Applications to Biology, Control, and Artificial Intelligence , 1992 .

[61]  Tommaso Minerva,et al.  Building ARMA Models with Genetic Algorithms , 2001, EvoWorkshops.

[62]  Siti Sakira Kamaruddin,et al.  Enhanced artificial bee colony for training least squares support vector machines in commodity price forecasting , 2014, J. Comput. Sci..

[63]  Eduard Muljadi,et al.  A Short-Term and High-Resolution Distribution System Load Forecasting Approach Using Support Vector Regression With Hybrid Parameters Optimization , 2018, IEEE Transactions on Smart Grid.

[64]  Wei-Chiang Hong,et al.  Forecasting holiday daily tourist flow based on seasonal support vector regression with adaptive genetic algorithm , 2015, Appl. Soft Comput..

[65]  Pei-Chann Chang,et al.  A Takagi-Sugeno fuzzy model combined with a support vector regression for stock trading forecasting , 2016, Appl. Soft Comput..

[66]  W. Fuller,et al.  Distribution of the Estimators for Autoregressive Time Series with a Unit Root , 1979 .

[67]  P. Whittle,et al.  Hypothesis-Testing in Time Series Analysis. , 1952 .

[68]  Zbigniew Michalewicz,et al.  Particle Swarm Optimization for Single Objective Continuous Space Problems: A Review , 2017, Evolutionary Computation.