A study on the medium-term forecasting using exogenous variable selection of the extra-virgin olive oil with soft computing methods

Time series forecasting is an important task for the business sector. Agents involved in the olive oil sector consider that, for the olive oil price, medium-term predictions are more important than short-term predictions. In collaboration with these agents the forecasting of the price of extra-virgin olive oil six months ahead has been established as the aim of this work. According to expert opinion, the use of exogenous variables and technical indicators can help in this task and must be included in the forecasting process. The amount of variables that can be considered makes necessary the use of feature selection algorithms in order to reduce the number of variables and to increase the interpretability and usefulness of the obtained forecasting system. Thus, in this paper CO2RBFN, a cooperative-competitive algorithm for Radial Basis Function Network design, and other soft computing methods have been applied to the data sets with the whole set of input variables and to the data sets with the selected set of input variables. The experimentation carried out shows that CO2RBFN obtains the best results in medium term forecasting for olive oil prices with the whole and with the selected set of input variables. Moreover, the feature selection methods applied to the data sets highlighted some influential variables which could be considered not only for the prediction but also for the description of the complex process involved in the medium-term forecasting of the olive oil price.

[1]  Maliha S. Nash,et al.  Handbook of Parametric and Nonparametric Statistical Procedures , 2001, Technometrics.

[2]  Alberto Gómez,et al.  Forecasting next-day price of electricity in the Spanish energy market using artificial neural networks , 2008, Eng. Appl. Artif. Intell..

[3]  Depei Bao,et al.  A generalized model for financial time series representation and prediction , 2007, Applied Intelligence.

[4]  M. Møller A Scaled Conjugate Gradient Algorithm for Fast Supervised Learning , 1990 .

[5]  Bruce J. Vanstone,et al.  An empirical methodology for developing stockmarket trading systems using artificial neural networks , 2009, Expert Syst. Appl..

[6]  David S. Broomhead,et al.  Multivariable Functional Interpolation and Adaptive Networks , 1988, Complex Syst..

[7]  Usama M. Fayyad,et al.  Multi-Interval Discretization of Continuous-Valued Attributes for Classification Learning , 1993, IJCAI.

[8]  Michel Verleysen,et al.  Non-linear financial time series forecasting application to the Bel 20 stock market index , 2000 .

[9]  George E. P. Box,et al.  Time Series Analysis: Box/Time Series Analysis , 2008 .

[10]  Massimiliano Versace,et al.  Predicting the exchange traded fund DIA with a combination of genetic algorithms and neural networks , 2004, Expert Syst. Appl..

[11]  Ignacio Rojas,et al.  A new hybrid methodology for cooperative-coevolutionary optimization of radial basis function networks , 2007, Soft Comput..

[12]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[13]  Eric R. Ziegel,et al.  Analysis of Financial Time Series , 2002, Technometrics.

[14]  Sheng Chen,et al.  Combined genetic algorithm optimization and regularized orthogonal least squares learning for radial basis function networks , 1999, IEEE Trans. Neural Networks.

[15]  James Nga-Kwok Liu,et al.  Automatic extraction and identification of chart patterns towards financial forecast , 2007, Appl. Soft Comput..

[16]  J. Murphy Technical Analysis of the Futures Markets: A Comprehensive Guide to Trading Methods and Applications , 1986 .

[17]  Zhou Quan,et al.  RBF Neural Network and ANFIS-Based Short-Term Load Forecasting Approach in Real-Time Price Environment , 2008, IEEE Transactions on Power Systems.

[18]  Bernhard Sick,et al.  Evolutionary optimization of radial basis function classifiers for data mining applications , 2005, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[19]  William H. Press,et al.  Numerical recipes in C , 2002 .

[20]  Hussein Dourra,et al.  Investment using technical analysis and fuzzy logic , 2002, Fuzzy Sets Syst..

[21]  Joydeep Ghosh,et al.  A neural network based hybrid system for detection, characterization, and classification of short-duration oceanic signals , 1992 .

[22]  Jyh-Shing Roger Jang,et al.  ANFIS: adaptive-network-based fuzzy inference system , 1993, IEEE Trans. Syst. Man Cybern..

[23]  Weiren Shi,et al.  The Research of Forecasting Model based on RBF Neural Network , 2005, 2005 International Conference on Neural Networks and Brain.

[24]  Janez Demsar,et al.  Statistical Comparisons of Classifiers over Multiple Data Sets , 2006, J. Mach. Learn. Res..

[25]  B. Yegnanarayana,et al.  Artificial Neural Networks , 2004 .

[26]  S. Achelis Technical analysis a to z , 1994 .

[27]  S. García,et al.  An Extension on "Statistical Comparisons of Classifiers over Multiple Data Sets" for all Pairwise Comparisons , 2008 .

[28]  L. Zadeh,et al.  Information, Uncertainty and Fusion , 2012 .

[29]  Kimon P. Valavanis,et al.  Surveying stock market forecasting techniques - Part II: Soft computing methods , 2009, Expert Syst. Appl..

[30]  Eibe Frank,et al.  Large-scale attribute selection using wrappers , 2009, 2009 IEEE Symposium on Computational Intelligence and Data Mining.

[31]  Antonio J. Rivera,et al.  CO2RBFN: an evolutionary cooperative–competitive RBFN design algorithm for classification problems , 2010, Soft Comput..

[32]  José Manuel Benítez,et al.  Feature Selection for Time Series Forecasting: A Case Study , 2008, 2008 Eighth International Conference on Hybrid Intelligent Systems.

[33]  S. Holm A Simple Sequentially Rejective Multiple Test Procedure , 1979 .

[34]  Ebrahim H. Mamdani,et al.  An Experiment in Linguistic Synthesis with a Fuzzy Logic Controller , 1999, Int. J. Hum. Comput. Stud..

[35]  José Manuel Benítez,et al.  Forecasting airborne pollen concentration time series with neural and neuro-fuzzy models , 2007, Expert Syst. Appl..

[36]  Kenneth A. De Jong,et al.  Cooperative Coevolution: An Architecture for Evolving Coadapted Subcomponents , 2000, Evolutionary Computation.

[37]  Oded Maimon,et al.  Dimension Reduction and Feature Selection , 2010, Data Mining and Knowledge Discovery Handbook.

[38]  Thomas Bäck,et al.  Evolutionary computation: comments on the history and current state , 1997, IEEE Trans. Evol. Comput..

[39]  Daryl Pregibon,et al.  A Statistical Perspective on Knowledge Discovery in Databases , 1996, Advances in Knowledge Discovery and Data Mining.

[40]  Les M. Howard,et al.  The GA-P: A Genetic Algorithm and Genetic Programming Hybrid , 1995, IEEE Expert.

[41]  Bernard Widrow,et al.  30 years of adaptive neural networks: perceptron, Madaline, and backpropagation , 1990, Proc. IEEE.

[42]  R. Tsay Analysis of Financial Time Series: Tsay/Financial Time Series 3E , 2010 .

[43]  Bruce J Vanstone,et al.  Designing Stock Market Trading Systems: With and without soft computing , 2010 .

[44]  J. Murphy Technical Analysis of the Financial Markets , 1999 .

[45]  Zhao Yan-xi Study and application of PSO-RBFNN model to nonlinear time series forecasting for geotechnical engineering , 2008 .

[46]  Isabelle Guyon,et al.  An Introduction to Variable and Feature Selection , 2003, J. Mach. Learn. Res..

[47]  Jesús Alcalá-Fdez,et al.  KEEL Data-Mining Software Tool: Data Set Repository, Integration of Algorithms and Experimental Analysis Framework , 2011, J. Multiple Valued Log. Soft Comput..

[48]  Robert H. Halstead,et al.  Matrix Computations , 2011, Encyclopedia of Parallel Computing.

[49]  Bruce A. Whitehead,et al.  Cooperative-competitive genetic evolution of radial basis function centers and widths for time series prediction , 1996, IEEE Trans. Neural Networks.

[50]  David E. Goldberg,et al.  Genetic Algorithms in Search Optimization and Machine Learning , 1988 .

[51]  Mario Cortina-Borja,et al.  Handbook of Parametric and Nonparametric Statistical Procedures, 5th edn , 2012 .

[52]  Inés Couso,et al.  Fuzzy random variables-based modeling with GA-P algorithms , 2000 .

[53]  Witold Pedrycz,et al.  Conditional fuzzy clustering in the design of radial basis function neural networks , 1998, IEEE Trans. Neural Networks.

[54]  Belén Melián-Batista,et al.  Solving feature subset selection problem by a Parallel Scatter Search , 2006, Eur. J. Oper. Res..

[55]  Gwilym M. Jenkins,et al.  Time series analysis, forecasting and control , 1971 .

[56]  John F. Roddick,et al.  A bibliography of temporal, spatial and spatio-temporal data mining research , 1999, SKDD.

[57]  Mohammad Taghi Hamidi Beheshti,et al.  A local linear radial basis function neural network for financial time-series forecasting , 2010, Applied Intelligence.

[58]  Ginés Rubio,et al.  Applying multiobjective RBFNNs optimization and feature selection to a mineral reduction problem , 2010, Expert Syst. Appl..

[59]  S. Hyakin,et al.  Neural Networks: A Comprehensive Foundation , 1994 .

[60]  Francisco Herrera,et al.  A study of statistical techniques and performance measures for genetics-based machine learning: accuracy and interpretability , 2009, Soft Comput..

[61]  Oded Maimon Knowledge Discovery and Data Mining : The Info-Fuzzy Network (IFN) Methodology , 2000 .

[62]  Pedro Isasi Viñuela,et al.  Soft computing techniques applied to finance , 2008, Applied Intelligence.

[63]  Ke Meng,et al.  Self-adaptive radial basis function neural network for short-term electricity price forecasting , 2009 .

[64]  Mevlut Ture,et al.  Comparison of four different time series methods to forecast hepatitis A virus infection , 2006, Expert Syst. Appl..

[65]  Gwo-Fong Lin,et al.  Time series forecasting by combining the radial basis function network and the self‐organizing map , 2005 .

[66]  Henry C. Co,et al.  Forecasting Thailand's rice export: Statistical techniques vs. artificial neural networks , 2007, Comput. Ind. Eng..

[67]  M. Saberi,et al.  Improved Estimation of Electricity Demand Function by Integration of Fuzzy System and Data Mining Approach , 2006, 2006 IEEE International Conference on Industrial Technology.

[68]  Francisco Herrera,et al.  A study on the use of non-parametric tests for analyzing the evolutionary algorithms’ behaviour: a case study on the CEC’2005 Special Session on Real Parameter Optimization , 2009, J. Heuristics.

[69]  Gene H. Golub,et al.  Matrix computations (3rd ed.) , 1996 .

[70]  Haiping Du,et al.  Time series prediction using evolving radial basis function networks with new encoding scheme , 2008, Neurocomputing.

[71]  John H. Holland,et al.  Adaptation in Natural and Artificial Systems: An Introductory Analysis with Applications to Biology, Control, and Artificial Intelligence , 1992 .

[72]  Antonio J. Rivera,et al.  CO2RBFN for short-term forecasting of the extra virgin olive oil price in the Spanish market , 2010, Int. J. Hybrid Intell. Syst..

[73]  Ronen Feldman,et al.  The Data Mining and Knowledge Discovery Handbook , 2005 .

[74]  D. Broomhead,et al.  Radial Basis Functions, Multi-Variable Functional Interpolation and Adaptive Networks , 1988 .

[75]  P. Young,et al.  Time series analysis, forecasting and control , 1972, IEEE Transactions on Automatic Control.

[76]  Lloyd A. Smith,et al.  Feature Selection for Machine Learning: Comparing a Correlation-Based Filter Approach to the Wrapper , 1999, FLAIRS.

[77]  Christian W. Dawson,et al.  A review of genetic algorithms applied to training radial basis function networks , 2004, Neural Computing & Applications.

[78]  Richard O. Duda,et al.  Pattern classification and scene analysis , 1974, A Wiley-Interscience publication.

[79]  Jooyoung Park,et al.  Approximation and Radial-Basis-Function Networks , 1993, Neural Computation.

[80]  F. Wilcoxon Individual Comparisons by Ranking Methods , 1945 .

[81]  Martin Fodslette Møller,et al.  A scaled conjugate gradient algorithm for fast supervised learning , 1993, Neural Networks.

[82]  Juan Julián Merelo Guervós,et al.  Evolving RBF neural networks for time-series forecasting with EvRBF , 2004, Inf. Sci..

[83]  Alaa F. Sheta,et al.  Time-series forecasting using GA-tuned radial basis functions , 2001, Inf. Sci..

[84]  Chih-Jen Lin,et al.  Working Set Selection Using Second Order Information for Training Support Vector Machines , 2005, J. Mach. Learn. Res..

[85]  Saeed Moshiri,et al.  Static, Dynamic, and Hybrid Neural Networks in Forecasting Inflation , 1998 .

[86]  Liqun Gao,et al.  Using an adaptive self-tuning approach to forecast power loads , 2008, Neurocomputing.

[87]  Francisco Herrera,et al.  Advanced nonparametric tests for multiple comparisons in the design of experiments in computational intelligence and data mining: Experimental analysis of power , 2010, Inf. Sci..

[88]  Chien-Cheng Lee,et al.  Noisy time series prediction using M-estimator based robust radial basis function neural networks with growing and pruning techniques , 2009, Expert Syst. Appl..

[89]  Mehdi Khashei,et al.  A new hybrid artificial neural networks and fuzzy regression model for time series forecasting , 2008, Fuzzy Sets Syst..

[90]  David E. Goldberg,et al.  Genetic Algorithms with Sharing for Multimodalfunction Optimization , 1987, ICGA.

[91]  María José del Jesús,et al.  KEEL: a software tool to assess evolutionary algorithms for data mining problems , 2008, Soft Comput..

[92]  Wynne Hsu,et al.  Temporal and Spatio-temporal Data Mining , 2007 .