Piecewise evolutionary segmentation for feature extraction in time series models

The design, development and implementation of an innovative system utilized in feature extraction from time series data models is described in this manuscript. Achieving to design piecewise segmentation patterns on the time series in an evolutionary fashion and use them in order to produce fitter secondary data sets, the developed system adapts itself to the nature of the problem each time and finally elects an optimally parameterized classifier (artificial neural network or support vector machine), along with the fittest time series segmentation pattern. The application of the system onto two different problems involving time series data analysis and requiring predictive and classification capabilities (torrential risk assessment and plant virus identification, respectively), reveals that the proposed methodology was crucial in finding the optimum solution for both problems. Piecewise evolutionary segmentation time series model analysis, utilized by the accompanying software tool, succeeded in controlling the dimensionality and noise inherent in the initial raw time series information. The process eventually proposes a segmentation pattern for each problem, enhancing the potential of the corresponding classifier.

[1]  Friedrich Recknagel,et al.  Applications of machine learning to ecological modelling , 2001 .

[2]  Simon Haykin,et al.  Neural Networks and Learning Machines , 2010 .

[3]  C. Fasseas,et al.  Study on the mechanism of Bioelectric Recognition Assay: evidence for immobilized cell membrane interactions with viral fragments. , 2004, Biosensors & bioelectronics.

[4]  T. Ouarda,et al.  Generalized autoregressive conditional heteroscedasticity modelling of hydrologic time series , 2012 .

[5]  Eamonn J. Keogh,et al.  Segmenting Time Series: A Survey and Novel Approach , 2002 .

[6]  Shian-Chang Huang,et al.  Integrating GA-based time-scale feature extractions with SVMs for stock index forecasting , 2008, Expert Syst. Appl..

[7]  Mikko Kolehmainen,et al.  Evolving the neural network model for forecasting air pollution time series , 2004, Eng. Appl. Artif. Intell..

[8]  Stefanos Zafeiriou,et al.  Use of random time-intervals (RTIs) generation for biometric verification , 2009, Pattern Recognit..

[9]  L. Bodri,et al.  Prediction of extreme precipitation using a neural network: application to summer flood occurence in Moravia , 2000 .

[10]  S. Kintzios,et al.  Development of a novel, multi-analyte biosensor system for assaying cell division: identification of cell proliferation/death precursor events. , 2006, Biosensors & bioelectronics.

[11]  Neil Davey,et al.  Time Series Prediction and Neural Networks , 2001, J. Intell. Robotic Syst..

[12]  Coskun Hamzaçebi,et al.  Improving artificial neural networks' performance in seasonal time series forecasting , 2008, Inf. Sci..

[13]  Jianguo Luo,et al.  Nonlinear noise reduction of chaotic time series based on multidimensional recurrent LS-SVM , 2008, Neurocomputing.

[14]  Yan-ping Wang,et al.  A forecasting and forewarning model for methane hazard in working face of coal mine based on LS-SVM , 2008 .

[15]  Wang Xiao-Ye,et al.  A structure-adaptive piece-wise linear segments representation for time series , 2004, Proceedings of the 2004 IEEE International Conference on Information Reuse and Integration, 2004. IRI 2004..

[16]  Bo Zhong,et al.  BP neural network with rough set for short term load forecasting , 2009, Expert Syst. Appl..

[17]  Gwilym M. Jenkins,et al.  Time series analysis, forecasting and control , 1971 .

[18]  Abraham Kandel,et al.  Data Mining in Time Series Database , 2004 .

[19]  Jui-Chung Hung,et al.  A genetic algorithm approach to the spectral estimation of time series with noise and missed observations , 2008, Inf. Sci..

[20]  C. P. Yialouris,et al.  Plant virus identification based on neural networks with evolutionary preprocessing , 2010 .

[21]  J. R. Ni,et al.  Application of artificial neural network to the rapid feedback of potential ecological risk in flood diversion zone , 2003 .

[22]  James B. McDonald,et al.  Time Series Prediction With Genetic‐Algorithm Designed Neural Networks: An Empirical Comparison With Modern Statistical Models , 1999, Comput. Intell..

[23]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[24]  J. Boaventura Cunha,et al.  Greenhouse air temperature predictive control using the particle swarm optimisation algorithm , 2005 .

[25]  Asok Ray,et al.  Pattern identification in dynamical systems via symbolic time series analysis , 2007, Pattern Recognit..

[26]  M. C. Deo,et al.  Real-time wave forecasting using genetic programming , 2008 .

[27]  Wei-Zhen Lu,et al.  Potential assessment of the "support vector machine" method in forecasting ambient air pollutant trends. , 2005, Chemosphere.

[28]  Éric D. Taillard,et al.  Analysis and test of efficient methods for building recursive deterministic perceptron neural networks , 2007, Neural Networks.

[29]  Bernhard E. Boser,et al.  A training algorithm for optimal margin classifiers , 1992, COLT '92.

[30]  Friedrich Recknagel,et al.  Discovery of predictive rule sets for chlorophyll-a dynamics in the Nakdong River (Korea) by means of the hybrid evolutionary algorithm HEA , 2006, Ecol. Informatics.

[31]  Sarah Lesher,et al.  Symbolic time-series analysis of neural data , 2000, Neurocomputing.

[32]  David C. Hoaglin,et al.  Applications, basics, and computing of exploratory data analysis , 1983 .

[33]  Lazaros S. Iliadis,et al.  Feature extraction for time-series data: An artificial neural network evolutionary training model for the management of mountainous watersheds , 2009, Neurocomputing.

[34]  C. L. Changa,et al.  Applying fuzzy theory and genetic algorithm to interpolate precipitation , 2005 .

[35]  Paulin Coulibaly,et al.  Comparison of neural network methods for infilling missing daily weather records , 2007 .

[36]  Wing Lon Ng,et al.  Analysis of ultra-high-frequency financial data using advanced Fourier transforms , 2009 .

[37]  Kamalesh Kumar Sharma,et al.  Time delay estimation using fractional Fourier transform , 2007, Signal Process..

[38]  Teresa Bernarda Ludermir,et al.  Meta-learning approaches to selecting time series models , 2004, Neurocomputing.

[39]  Xiaohu Yang,et al.  A novel piecewise linear segmentation for time series , 2010, 2010 The 2nd International Conference on Computer and Automation Engineering (ICCAE).

[40]  Kwok-wing Chau A split-step particle swarm optimization algorithm in river stage forecasting , 2007 .

[41]  Jesús García,et al.  Piecewise Linear Representation Segmentation as a Multiobjective Optimization Problem , 2010, DCAI.

[42]  Haiping Du,et al.  Time series prediction using evolving radial basis function networks with new encoding scheme , 2008, Neurocomputing.

[43]  Turan G. Bali THE INTERTEMPORAL RELATION BETWEEN EXPECTED RETURNS AND RISK , 2008 .

[44]  Tzung-Pei Hong,et al.  Cluster-based genetic segmentation of time series with DWT , 2009, Pattern Recognit. Lett..

[45]  Michel Verleysen,et al.  Representation of functional data in neural networks , 2005, Neurocomputing.

[46]  Tak-Chung Fu,et al.  A review on time series data mining , 2011, Eng. Appl. Artif. Intell..

[47]  Sreeram Ramakrishnan,et al.  A hybrid approach for feature subset selection using neural networks and ant colony optimization , 2007, Expert Syst. Appl..

[48]  Keiko Yamaguchi,et al.  Reexamination of stock price reaction to environmental performance: A GARCH application , 2008 .

[49]  J. Chow,et al.  A hybrid ARIMA and artificial neural networks model to forecast particulate matter in urban areas: The case of Temuco, Chile , 2008 .