A hybrid dynamic exploitation barebones particle swarm optimisation algorithm for time series segmentation

Abstract Large time series are difficult to be mined and preprocessed, hence reducing their number of points with minimum information loss is an active field of study. This paper proposes new methods based on time series segmentation, including the adaptation of the particle swarm optimisation algorithm (PSO) to this problem, and more advanced PSO versions, such as barebones PSO (BBPSO) and its exploitation version (BBePSO). Moreover, a novel algorithm is derived, referred to as dynamic exploitation barebones PSO (DBBePSO), which updates the importance of the social and cognitive components throughout the generations. All these algorithms are further improved by considering a final local search step based on the combination of two well-known standard segmentation algorithms (Bottom-Up and Top-Down). The performance of the different methods is evaluated using 15 time series from various application fields, and the results show that the novel algorithm (DBBePSO) and its hybrid version (HDBBePSO) outperform the rest of segmentation techniques.

[1]  Pedro Antonio Gutiérrez,et al.  Identifying Market Behaviours Using European Stock Index Time Series by a Hybrid Segmentation Algorithm , 2017, Neural Processing Letters.

[2]  János Abonyi,et al.  Modified Gath-Geva clustering for fuzzy segmentation of multivariate time-series , 2005, Fuzzy Sets Syst..

[3]  Geeta Sikka,et al.  Recent Techniques of Clustering of Time Series Data: A Survey , 2012 .

[4]  Andries Petrus Engelbrecht,et al.  Barebones Particle Swarm for Integer Programming Problems , 2007, 2007 IEEE Swarm Intelligence Symposium.

[5]  Y. Bodyanskiy,et al.  Robust Recursive Fuzzy Clustering-Based Segmentation of Biological Time Series , 2006, 2006 International Symposium on Evolving Fuzzy Systems.

[6]  Marc Salotti An efficient algorithm for the optimal polygonal approximation of digitized curves , 2001, Pattern Recognit. Lett..

[7]  Eamonn J. Keogh,et al.  Experimental comparison of representation methods and distance measures for time series data , 2010, Data Mining and Knowledge Discovery.

[8]  Jeffrey M. Hausdorff,et al.  Physionet: Components of a New Research Resource for Complex Physiologic Signals". Circu-lation Vol , 2000 .

[9]  Hüseyin Kaya,et al.  A distance based time series classification framework , 2015, Inf. Syst..

[10]  Eamonn J. Keogh,et al.  A symbolic representation of time series, with implications for streaming algorithms , 2003, DMKD '03.

[11]  James R. Wilson,et al.  Empirical Investigation of the Benefits of Partial Lamarckianism , 1997, Evolutionary Computation.

[12]  Thomas Guyet,et al.  Long term analysis of time series of satellite images , 2016, Pattern Recognit. Lett..

[13]  Janez Demsar,et al.  Statistical Comparisons of Classifiers over Multiple Data Sets , 2006, J. Mach. Learn. Res..

[14]  Ying Wah Teh,et al.  Time-series clustering - A decade review , 2015, Inf. Syst..

[15]  G.B. Moody,et al.  The impact of the MIT-BIH Arrhythmia Database , 2001, IEEE Engineering in Medicine and Biology Magazine.

[16]  Liang Zhao,et al.  Time series clustering via community detection in networks , 2015, Inf. Sci..

[17]  Pedro Antonio Gutiérrez,et al.  Detection of early warning signals in paleoclimate data using a genetic time series segmentation algorithm , 2015, Climate Dynamics.

[18]  Jonathan J. Oliver,et al.  Bayesian Approaches to Segmenting A Simple Time Series , 1997 .

[19]  Emile H. L. Aarts,et al.  Genetic Local Search Algorithms for the Travelling Salesman Problem , 1990, PPSN.

[20]  V. K. Koumousis,et al.  A saw-tooth genetic algorithm combining the effects of variable population size and reinitialization to enhance performance , 2006, IEEE Transactions on Evolutionary Computation.

[21]  Pedro Antonio Gutiérrez,et al.  A statistically-driven Coral Reef Optimization algorithm for optimal size reduction of time series , 2018, Appl. Soft Comput..

[22]  Mu-Yen Chen,et al.  A hybrid fuzzy time series model based on granular computing for stock price forecasting , 2015, Inf. Sci..

[23]  Carlos Agón,et al.  Time-series data mining , 2012, CSUR.

[24]  James Kennedy,et al.  Bare bones particle swarms , 2003, Proceedings of the 2003 IEEE Swarm Intelligence Symposium. SIS'03 (Cat. No.03EX706).

[25]  Pedro Antonio Gutiérrez,et al.  Detection and prediction of segments containing extreme significant wave heights , 2017 .

[26]  Ming Xin,et al.  Adaptive multi-cue based particle swarm optimization guided particle filter tracking in infrared videos , 2013, Neurocomputing.

[27]  Maurice Clerc,et al.  The particle swarm - explosion, stability, and convergence in a multidimensional complex space , 2002, IEEE Trans. Evol. Comput..

[28]  I. Johnstone,et al.  Wavelet Shrinkage: Asymptopia? , 1995 .

[29]  M. Friedman A Comparison of Alternative Tests of Significance for the Problem of $m$ Rankings , 1940 .

[30]  Tak-Chung Fu,et al.  An evolutionary approach to pattern-based time series segmentation , 2004, IEEE Transactions on Evolutionary Computation.

[31]  María Pérez-Ortiz,et al.  On the use of evolutionary time series analysis for segmenting paleoclimate data , 2017, Neurocomputing.

[32]  Erwin Pesch,et al.  Genetic Local Search in Combinatorial Optimization , 1994, Discret. Appl. Math..

[33]  Hamid Reza Karimi,et al.  Prediction of stock index futures prices based on fuzzy sets and multivariate fuzzy time series , 2015, Neurocomputing.

[34]  Jeffrey A. Joines,et al.  Utilizing Hybrid Genetic Algorithms , 2003 .

[35]  I. Johnstone,et al.  Ideal spatial adaptation by wavelet shrinkage , 1994 .

[36]  Guoyin Wang,et al.  A novel water quality data analysis framework based on time-series data mining. , 2017, Journal of environmental management.

[37]  Kwok-wing Chau,et al.  Particle Swarm Optimization Training Algorithm for ANNs in Stage Prediction of Shing Mun River , 2006 .

[38]  Adisa A. Jimoh,et al.  Particle swarm optimization for power system state estimation , 2015, Neurocomputing.

[39]  Pedro Antonio Gutiérrez,et al.  Time Series Representation by a Novel Hybrid Segmentation Algorithm , 2016, HAIS.

[40]  Hector Budman,et al.  A hybrid clustering approach for multivariate time series - A case study applied to failure analysis in a gas turbine. , 2017, ISA transactions.

[41]  Laurent Itti,et al.  Classifying Time Series Using Local Descriptors with Hybrid Sampling , 2016, IEEE Transactions on Knowledge and Data Engineering.

[42]  C. S. Wallace,et al.  Minimum Message Length Segmentation , 1998, PAKDD.

[43]  Simon Fong,et al.  Financial time series pattern matching with extended UCR Suite and Support Vector Machine , 2016, Expert Syst. Appl..

[44]  Tzung-Pei Hong,et al.  Cluster-based genetic segmentation of time series with DWT , 2009, Pattern Recognit. Lett..

[45]  I. Johnstone,et al.  Adapting to Unknown Smoothness via Wavelet Shrinkage , 1995 .

[46]  Jason Lines,et al.  Time-Series Classification with COTE: The Collective of Transformation-Based Ensembles , 2015, IEEE Trans. Knowl. Data Eng..

[47]  Eamonn J. Keogh,et al.  Segmenting Time Series: A Survey and Novel Approach , 2002 .

[48]  Bernhard Sick,et al.  On-line motif detection in time series with SwiftMotif , 2009, Pattern Recognit..

[49]  Riccardo Poli,et al.  Particle swarm optimization , 1995, Swarm Intelligence.

[50]  Ángel Carmona Poyato,et al.  New method for obtaining optimal polygonal approximations to solve the min-$$\varepsilon$$ε problem , 2016, Neural Computing and Applications.

[51]  Andries Petrus Engelbrecht,et al.  Particle swarm optimization: Velocity initialization , 2012, 2012 IEEE Congress on Evolutionary Computation.