Data Mining in Time Series Database

A Survey of Recent Methods for Efficient Retrieval of Similar Time Sequences (H M Lie) Indexing of Compressed Time Series (E Fink & K Pratt) Boosting Interval-Based Literal: Variable Length and Early Classification (J J Rodriguez Diez) Segmenting Time Series: A Survey and Novel Approach (E Keogh et al) Indexing Similar Time Series under Conditions of Noise (M Vlachos et al) Classification of Events in Time Series of Graphs (H Bunke & M Kraetzl) Median Strings - A Review (X Jiang et al) Change Detection in Classification Models of Data Mining (G Zeira et al).

[1]  Tom Fawcett,et al.  Activity monitoring: noticing interesting changes in behavior , 1999, KDD '99.

[2]  Sergio Barrachina,et al.  Speeding up the computation of the edit distance for cyclic strings , 2000, Proceedings 15th International Conference on Pattern Recognition. ICPR-2000.

[3]  Philip S. Yu,et al.  Data Mining: An Overview from a Database Perspective , 1996, IEEE Trans. Knowl. Data Eng..

[4]  Christos Faloutsos,et al.  A signature technique for similarity-based queries , 1997, Proceedings. Compression and Complexity of SEQUENCES 1997 (Cat. No.97TB100171).

[5]  Dan Gusfield,et al.  Algorithms on Strings, Trees, and Sequences - Computer Science and Computational Biology , 1997 .

[6]  Jean Meunier,et al.  Average Brain Models: A Convergence Study , 2000, Comput. Vis. Image Underst..

[7]  Dimitrios Gunopulos,et al.  On indexing mobile objects , 1999, PODS '99.

[8]  Alberto O. Mendelzon,et al.  Similarity-based queries for time series data , 1997, SIGMOD '97.

[9]  Peter J. Rodgers,et al.  A Graph-Rewriting Visual Language for Database Programming , 1997, J. Vis. Lang. Comput..

[10]  Sharad Mehrotra,et al.  Local Dimensionality Reduction: A New Approach to Indexing High Dimensional Spaces , 2000, VLDB.

[11]  Eamonn J. Keogh,et al.  A Simple Dimensionality Reduction Technique for Fast Similarity Search in Large Time Series Databases , 2000, PAKDD.

[12]  Charles K. Chui,et al.  An Introduction to Wavelets , 1992 .

[13]  Robert E. Schapire,et al.  A Brief Introduction to Boosting , 1999, IJCAI.

[14]  Abraham Kandel,et al.  On the Weighted Mean of a Pair of Strings , 2002, Pattern Analysis & Applications.

[15]  Geoff Hulten,et al.  Mining time-changing data streams , 2001, KDD '01.

[16]  Giuseppe Psaila,et al.  Querying Shapes of Histories , 1995, VLDB.

[17]  R. H. Jones,et al.  Change detection model for serially correlated multivariate data. , 1970, Biometrics.

[18]  A. Balaban Chemical applications of graph theory , 1976 .

[19]  D. West Introduction to Graph Theory , 1995 .

[20]  Chuanyi Ji,et al.  Beyond thresholds: an alternative method for extracting information from network measurements , 1997, GLOBECOM 97. IEEE Global Telecommunications Conference. Conference Record.

[21]  Christian S. Jensen,et al.  Indexing the Positions of Continuously Moving Objects , 2000, SIGMOD Conference.

[22]  Haixun Wang,et al.  Landmarks: a new model for similarity-based pattern querying in time series databases , 2000, Proceedings of 16th International Conference on Data Engineering (Cat. No.00CB37073).

[23]  Ernst Fernando Lopes Da Silva Niedermeyer,et al.  Electroencephalography, basic principles, clinical applications, and related fields , 1982 .

[24]  Eamonn J. Keogh,et al.  Relevance feedback retrieval of time series data , 1999, SIGIR '99.

[25]  Moshé M. Zloof Query by example , 1975, AFIPS '75.

[26]  Dieter Pfoser,et al.  Capturing the Uncertainty of Moving-Object Representations , 1999, SSD.

[27]  Stephen D. Walter False Positive Rate , 2005 .

[28]  Introduction to graph grammars with applications to semantic networks , 1992 .

[29]  Changzhou Wang,et al.  Supporting fast search in time series for movement patterns in multiple scales , 1998, CIKM '98.

[30]  Henrik Boström,et al.  Learning First Order Logic Time Series Classifiers: Rules and Boosting , 2000, PKDD.

[31]  Dimitrios Gunopulos,et al.  Finding Similar Time Series , 1997, PKDD.

[32]  Robyn A. Owens,et al.  Averaging feature maps , 1999, Pattern Recognit..

[33]  James A. Hendler,et al.  The Case for Graph-Structured Representations , 1997, ICCBR.

[34]  David S. Stoffer Detecting Common Signals in Multiple Time Series Using the Spectral Envelope , 1999 .

[35]  Eamonn J. Keogh,et al.  Locally adaptive dimensionality reduction for indexing large time series databases , 2001, SIGMOD '01.

[36]  Alberto O. Mendelzon,et al.  Querying Time Series Data Based on Similarity , 2000, IEEE Trans. Knowl. Data Eng..

[37]  Ling Lin,et al.  Querying Continuous Time Sequences , 1998, VLDB.

[38]  Hagit Shatkay,et al.  Approximate queries and representations for large data sequences , 1996, Proceedings of the Twelfth International Conference on Data Engineering.

[39]  Eamonn J. Keogh,et al.  Scaling up Dynamic Time Warping to Massive Dataset , 1999, PKDD.

[40]  Michael J. Fischer,et al.  The String-to-String Correction Problem , 1974, JACM.

[41]  John Riedl,et al.  Visualization of biological sequence similarity search results , 1995, Proceedings Visualization '95.

[42]  Sameer Singh,et al.  Dynamic time-series forecasting using local approximation , 1998, Proceedings Tenth IEEE International Conference on Tools with Artificial Intelligence (Cat. No.98CH36294).

[43]  Ron Kohavi,et al.  The Power of Decision Tables , 1995, ECML.

[44]  Jens Gregor,et al.  Dynamic Programming Alignment of Sequences Representing Cyclic Patterns , 1993, IEEE Trans. Pattern Anal. Mach. Intell..

[45]  Dan Gusfield,et al.  Algorithms on Strings, Trees, and Sequences - Computer Science and Computational Biology , 1997 .

[46]  Keita Ikeda,et al.  Wavelet decomposition of heart period data , 1999, Proceedings of the First Joint BMES/EMBS Conference. 1999 IEEE Engineering in Medicine and Biology 21st Annual Conference and the 1999 Annual Fall Meeting of the Biomedical Engineering Society (Cat. N.

[47]  Dimitrios Gunopulos,et al.  Time-series similarity problems and well-separated geometric sets , 1997, SCG '97.

[48]  Padhraic Smyth,et al.  Segmental Semi-Markov Models for Endpoint Detection in Plasma Etching , 2000 .

[49]  Thomas G. Dietterich,et al.  Mining IC test data to optimize VLSI testing , 2000, KDD '00.

[50]  Jeong Seop Sim,et al.  The consensus string problem for a metric is NP-complete , 2003, J. Discrete Algorithms.

[51]  Henrik Boström,et al.  Boosting interval based literals , 2001, Intell. Data Anal..

[52]  Michel Verhaegen,et al.  ECG Segmentation Using Time-Warping , 1997, IDA.

[53]  Man Hon Wong,et al.  Fast time-series searching with scaling and shifting , 1999, PODS '99.

[54]  Tony R. Martinez Consistency and generalization in incrementally trained connectionist networks , 1990, IEEE International Symposium on Circuits and Systems.

[55]  Horst Bunke,et al.  Classes of cost functions for string edit distance , 2006, Algorithmica.

[56]  Geoff Hulten,et al.  Mining high-speed data streams , 2000, KDD '00.

[57]  King-Sun Fu,et al.  A distance measure between attributed relational graphs for pattern recognition , 1983, IEEE Transactions on Systems, Man, and Cybernetics.

[58]  Philip S. Yu,et al.  MALM: a framework for mining sequence database at multiple abstraction levels , 1998, CIKM '98.

[59]  Ambuj K. Singh,et al.  Variable length queries for time series data , 2001, Proceedings 17th International Conference on Data Engineering.

[60]  Byoung-Tak Zhang An incremental learning algorithm that optimizes network size and sample size in one trial , 1994, Proceedings of 1994 IEEE International Conference on Neural Networks (ICNN'94).

[61]  Daniel Barbará,et al.  Mobile Computing and Databases - A Survey , 1999, IEEE Trans. Knowl. Data Eng..

[62]  Gerhard Widmer,et al.  Learning in the Presence of Concept Drift and Hidden Contexts , 1996, Machine Learning.

[63]  Eamonn J. Keogh,et al.  Finding surprising patterns in a time series database in linear time and space , 2002, KDD.

[64]  Shinji Umeyama,et al.  An Eigendecomposition Approach to Weighted Graph Matching Problems , 1988, IEEE Trans. Pattern Anal. Mach. Intell..

[65]  Horst Bunke,et al.  On Median Graphs: Properties, Algorithms, and Applications , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[66]  Christos Faloutsos,et al.  FastMap: a fast algorithm for indexing, data-mining and visualization of traditional and multimedia datasets , 1995, SIGMOD '95.

[67]  Salvatore J. Stolfo,et al.  Sharing Learned Models among Remote Database Partitions by Local Meta-Learning , 1996, KDD.

[68]  S. Strogatz Exploring complex networks , 2001, Nature.

[69]  Richard A. Davis,et al.  Introduction to time series and forecasting , 1998 .

[70]  Tao Jiang,et al.  On the Complexity of Multiple Sequence Alignment , 1994, J. Comput. Biol..

[71]  Josep Lladós,et al.  Symbol Recognition by Error-Tolerant Subgraph Matching between Region Adjacency Graphs , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[72]  Clement T. Yu,et al.  Haar Wavelets for Efficient Similarity Search of Time-Series: With and Without Time Warping , 2003, IEEE Trans. Knowl. Data Eng..

[73]  Stephen Guattery,et al.  On the Quality of Spectral Separators , 1998, SIAM J. Matrix Anal. Appl..

[74]  Gaston H. Gonnet,et al.  A fast algorithm on average for all-against-all sequence matching , 1999, 6th International Symposium on String Processing and Information Retrieval. 5th International Workshop on Groupware (Cat. No.PR00268).

[75]  Jens Gregor,et al.  Efficient dynamic programming alignment of cyclic strings by shift elimination , 1996, Pattern Recognit..

[76]  Allan R. Wilks,et al.  Visualizing Network Data , 1995, IEEE Trans. Vis. Comput. Graph..

[77]  M. Kraetzl,et al.  Detection of abnormal change in dynamic networks , 1999, 1999 Information, Decision and Control. Data and Information Fusion Symposium, Signal Processing and Communications Symposium and Decision and Control Symposium. Proceedings (Cat. No.99EX251).

[78]  Michael Doob,et al.  Spectra of graphs , 1980 .

[79]  P.K Sahoo,et al.  A survey of thresholding techniques , 1988, Comput. Vis. Graph. Image Process..

[80]  Kim L. Boyer,et al.  Quantitative measures of change based on feature organization: eigenvalues and eigenvectors , 1996, Proceedings CVPR IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[81]  Janet L. Kolodner,et al.  Indexing and Retrieval , 1993 .

[82]  Fan Chung,et al.  Spectral Graph Theory , 1996 .

[83]  Gene H. Hostetter,et al.  Scan-Along Polygonal Approximation for Data Compression of Electrocardiograms , 1983, IEEE Transactions on Biomedical Engineering.

[84]  Z. Meral Özsoyoglu,et al.  Indexing large metric spaces for similarity search queries , 1999, TODS.

[85]  Piotr Indyk,et al.  Similarity Search in High Dimensions via Hashing , 1999, VLDB.

[86]  Jim Hunter,et al.  Knowledge-Based Event Detection in Complex Time Series Data , 1999, AIMDM.

[87]  Jaakko Astola,et al.  On computation of the running median , 1989, IEEE Trans. Acoust. Speech Signal Process..

[88]  Jaideep Srivastava,et al.  Pattern Directed Mining of Sequence Data , 1998, KDD.

[89]  C. Burrus,et al.  Introduction to Wavelets and Wavelet Transforms: A Primer , 1997 .

[90]  Chelsea C. White,et al.  An analytical approach to the dynamic topology problem , 1994, Telecommun. Syst..

[91]  R. Coifman,et al.  Local feature extraction and its applications using a library of bases , 1994 .

[92]  Yishay Mansour,et al.  Learning Under Persistent Drift , 1997, EuroCOLT.

[93]  张海涛,et al.  BN , 1994 .

[94]  A. Raftery,et al.  Space-time modeling with long-memory dependence: assessing Ireland's wind-power resource. Technical report , 1987 .

[95]  Kuniaki Uehara,et al.  Extraction of Primitive Motion for Human Motion Recognition , 1999, Discovery Science.

[96]  Heikki Mannila,et al.  Rule Discovery from Time Series , 1998, KDD.

[97]  Wesley W. Chu,et al.  An index-based approach for similarity search supporting time warping in large sequence databases , 2001, Proceedings 17th International Conference on Data Engineering.

[98]  Martti Juhola,et al.  Syntactic recognition of ECG signals by attributed finite automata , 1995, Pattern Recognit..

[99]  Donald J. Berndt,et al.  Using Dynamic Time Warping to Find Patterns in Time Series , 1994, KDD Workshop.

[100]  Juan Pedro Caraça-Valente,et al.  Discovering similar patterns in time series , 2000, KDD '00.

[101]  Hanan Samet,et al.  The Design and Analysis of Spatial Data Structures , 1989 .

[102]  Hans-Jörg Schek,et al.  A Quantitative Analysis and Performance Study for Similarity-Search Methods in High-Dimensional Spaces , 1998, VLDB.

[103]  Juan José Rodríguez Diez,et al.  Applying Boosting to Similarity Literals for Time Series Classification , 2000, Multiple Classifier Systems.

[104]  Claus Gramkow,et al.  On Averaging Rotations , 2001, International Journal of Computer Vision.

[105]  Aidong Zhang,et al.  WaveCluster: A Multi-Resolution Clustering Approach for Very Large Spatial Databases , 1998, VLDB.

[106]  Svetha Venkatesh,et al.  Video indexing and similarity retrieval by largest common subgraph detection using decision trees , 2001, Pattern Recognit..

[107]  Christos Faloutsos,et al.  Fast Time Sequence Indexing for Arbitrary Lp Norms , 2000, VLDB.

[108]  Kaizhong Zhang,et al.  Evaluating a class of distance-mapping algorithms for data mining and clustering , 1999, KDD '99.

[109]  Huan Liu,et al.  Incremental Feature Selection , 1998, Applied Intelligence.

[110]  Donald J. Berndt,et al.  Finding Patterns in Time Series: A Dynamic Programming Approach , 1996, Advances in Knowledge Discovery and Data Mining.

[111]  Abraham Kandel,et al.  Knowledge discovery in time series databases , 2001, IEEE Trans. Syst. Man Cybern. Part B.

[112]  R. Boutaba,et al.  An outlook on intranet management , 1997 .

[113]  J. Kruskal An Overview of Sequence Comparison: Time Warps, String Edits, and Macromolecules , 1983 .

[114]  Salvatore J. Stolfo,et al.  Performance of incremental update in database rule processing , 1994, Proceedings of IEEE International Workshop on Research Issues in Data Engineering: Active Databases Systems.

[115]  R. T. Ogden,et al.  Testing change-points with linear trend , 1994 .

[116]  Bojan Mohar,et al.  Laplace eigenvalues of graphs - a survey , 1992, Discret. Math..

[117]  Eamonn J. Keogh,et al.  Exact indexing of dynamic time warping , 2002, Knowledge and Information Systems.

[118]  Robin Wilson,et al.  Applications of graph theory , 1979 .

[119]  Abraham Kandel,et al.  Data Mining and Computational Intelligence , 2001 .

[120]  Hagit Shatkay,et al.  The Fourier Transform - A Primer , 1995 .

[121]  Christos Faloutsos,et al.  Efficient Similarity Search In Sequence Databases , 1993, FODO.

[122]  Oded Maimon Knowledge Discovery and Data Mining : The Info-Fuzzy Network (IFN) Methodology , 2000 .

[123]  Heikki Mannila,et al.  Similarity of event sequences , 1997, Proceedings of TIME '97: 4th International Workshop on Temporal Representation and Reasoning.

[124]  Ching Y. Suen,et al.  Hierarchical attributed graph representation and recognition of handwritten chinese characters , 1991, Pattern Recognit..

[125]  Padhraic Smyth,et al.  Trajectory clustering with mixtures of regression models , 1999, KDD '99.

[126]  Eamonn J. Keogh,et al.  Scaling up dynamic time warping for datamining applications , 2000, KDD '00.

[127]  Amir B. Geva Hierarchical-fuzzy clustering of temporal-patterns and its application for time-series prediction , 1999, Pattern Recognit. Lett..

[128]  Craig A. Knoblock,et al.  Discovering Robust Knowledge from Databases that Change , 1998, Data Mining and Knowledge Discovery.

[129]  Ronald Fagin,et al.  Relaxing the Triangle Inequality in Pattern Matching , 2004, International Journal of Computer Vision.

[130]  David Sankoff,et al.  Time Warps, String Edits, and Macromolecules: The Theory and Practice of Sequence Comparison , 1983 .

[131]  R. J. Alcock Time-Series Similarity Queries Employing a Feature-Based Approach , 1999 .

[132]  Christos Faloutsos,et al.  The R+-Tree: A Dynamic Index for Multi-Dimensional Objects , 1987, VLDB.

[133]  Enrique Vidal,et al.  Computation of Normalized Edit Distance and Applications , 1993, IEEE Trans. Pattern Anal. Mach. Intell..

[134]  M. Fiedler Algebraic connectivity of graphs , 1973 .

[135]  Josep Lladós,et al.  A mean string algorithm to compute the average among a set of 2D shapes , 2002, Pattern Recognit. Lett..

[136]  Divyakant Agrawal,et al.  A comparison of DFT and DWT based similarity search in time-series databases , 2000, CIKM '00.

[137]  Kyuseok Shim,et al.  Fast Similarity Search in the Presence of Noise, Scaling, and Translation in Time-Series Databases , 1995, VLDB.

[138]  Alberto O. Mendelzon,et al.  Similarity-based queries , 1995, PODS '95.

[139]  Horst Bunke,et al.  A New Algorithm for Error-Tolerant Subgraph Isomorphism Detection , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[140]  Douglas H. Fisher,et al.  Knowledge Acquisition Via Incremental Conceptual Clustering , 1987, Machine Learning.

[141]  Christos Faloutsos,et al.  Efficient retrieval of similar time sequences under time warping , 1998, Proceedings 14th International Conference on Data Engineering.

[142]  Philip S. Yu,et al.  Adaptive query processing for time-series data , 1999, KDD '99.

[143]  Pankaj K. Agarwal,et al.  Indexing Moving Points , 2003, J. Comput. Syst. Sci..

[144]  Deok-Hwan Kim,et al.  Similarity search for multidimensional data sequences , 2000, Proceedings of 16th International Conference on Data Engineering (Cat. No.00CB37073).

[145]  David H. Douglas,et al.  ALGORITHMS FOR THE REDUCTION OF THE NUMBER OF POINTS REQUIRED TO REPRESENT A DIGITIZED LINE OR ITS CARICATURE , 1973 .

[146]  Eugene Fink,et al.  Search for Patterns in Compressed Time Series , 2002, Int. J. Image Graph..

[147]  Changzhou Wang,et al.  Supporting content-based searches on time series via approximation , 2000, Proceedings. 12th International Conference on Scientific and Statistica Database Management.

[148]  Jaideep Srivastava,et al.  Event detection from time series data , 1999, KDD '99.

[149]  Antonin Guttman,et al.  R-trees: a dynamic index structure for spatial searching , 1984, SIGMOD '84.

[150]  Eamonn J. Keogh,et al.  Dimensionality Reduction for Fast Similarity Search in Large Time Series Databases , 2001, Knowledge and Information Systems.

[151]  Luisa Micó,et al.  An approximate median search algorithm in non-metric spaces , 2001, Pattern Recognit. Lett..

[152]  Dina Q. Goldin,et al.  On Similarity Queries for Time-Series Data: Constraint Specification and Implementation , 1995, CP.

[153]  Hongjun Lu,et al.  Stock movement prediction and N-dimensional inter-transaction association rules , 1998, SIGMOD 1998.

[154]  Francisco Casacuberta,et al.  Topology of Strings: Median String is NP-Complete , 1999, Theor. Comput. Sci..

[155]  Ramakrishnan Srikant,et al.  The Quest Data Mining System , 1996, KDD.

[156]  Salvatore J. Stolfo,et al.  A Comparative Evaluation of Voting and Meta-learning on Partitioned Data , 1995, ICML.