Segmentation of Time Series Data

Time series data is usually generated by measuring and monitoring applications, and accounts for a large fraction of the data available for analysis purposes. A time series is typically a sequence of values that represent the state of a variable over time. Each value of the variable might be a simple value, or might have a composite structure, such as a vector of values. Time series data can be collected about natural phenomena, such as the amount of rainfall in a geographical region, or about a human activity, such as the number of shares of GoogleTM stock sold each day. Time series data is typically used for predicting future behavior from historical performance. However, a time series often needs further processing to discover the structure and properties of the recorded variable, thereby facilitating the understanding of past behavior and prediction of future behavior. Segmentation of a given time series is often used to compactly represent the time series (Gionis & Mannila, 2005), to reduce noise, and to serve as a high-level representation of the data (Das, Lin, Mannila, Renganathan & Smyth, 1998; Keogh & Kasetty, 2003). Data mining of a segmentation of a time series, rather than the original time series itself, has been used to facilitate discovering structure in the data, and finding various kinds of information, such as abrupt changes in the model underlying the time series (Duncan & Bryant, 1996; Keogh & Kasetty, 2003), event detection (Guralnik & Srivastava, 1999), etc. The rest of this chapter is organized as follows. The section on Background gives an overview of the time series segmentation problem and solutions. This section is followed by a Main Focus section where details of the tasks involved in segmenting a given time series and a few sample applications are discussed. Then, the Future Trends section presents some of the current research trends in time series segmentation and the Conclusion section concludes the chapter. Several important terms and their definitions are also included at the end of the chapter.

[1]  Daniel J. Rosenkrantz,et al.  On lossy time decompositions of time stamped documents , 2004, CIKM '04.

[2]  Ingrid Fischer Neural Networks and Graph Transformations , 2009, Encyclopedia of Data Warehousing and Mining.

[3]  Theodosios Pavlidis,et al.  Segmentation of Plane Curves , 1974, IEEE Transactions on Computers.

[4]  Paul R. Cohen,et al.  Unsupervised segmentation of categorical time series into episodes , 2002, 2002 IEEE International Conference on Data Mining, 2002. Proceedings..

[5]  Daniel Lemire,et al.  A Better Alternative to Piecewise Linear Time Series Segmentation , 2006, SDM.

[6]  Tak-Chung Fu,et al.  An evolutionary approach to pattern-based time series segmentation , 2004, IEEE Transactions on Evolutionary Computation.

[7]  Eamonn J. Keogh,et al.  On the Need for Time Series Data Mining Benchmarks: A Survey and Empirical Demonstration , 2002, Data Mining and Knowledge Discovery.

[8]  Jaideep Srivastava,et al.  Event detection from time series data , 1999, KDD '99.

[9]  Heikki Mannila,et al.  Time series segmentation for context recognition in mobile devices , 2001, Proceedings 2001 IEEE International Conference on Data Mining.

[10]  Daniel J. Rosenkrantz,et al.  Discovering Dynamic Developer Relationships from Software Version Histories by Time Series Segmentation , 2007, 2007 IEEE International Conference on Software Maintenance.

[11]  Philip Calvert,et al.  Encyclopedia of Data Warehousing and Mining , 2006 .

[12]  János Abonyi,et al.  Modified Gath-Geva clustering for fuzzy segmentation of multivariate time-series , 2005, Fuzzy Sets Syst..

[13]  John Wang,et al.  Data Warehousing and Mining: Concepts, Methodologies, Tools, and Applications , 2008 .

[14]  Tzung-Pei Hong,et al.  Segmentation of Time Series by the Clustering and Genetic Algorithms , 2006, Sixth IEEE International Conference on Data Mining - Workshops (ICDMW'06).

[15]  Kate Smith-Miles,et al.  Kernal Width Selection for SVM Classification: A Meta-Learning Approach , 2005, Int. J. Data Warehous. Min..