Finding relevant sequences in time series containing crisp, interval, and fuzzy interval data

Finding similar sequences in time series has received much attention and is a widely studied topic. Most existing approaches in the time series area focus on the efficiency of algorithms but seldom provide a means to handle imprecise data. In this paper, a more general approach is proposed to measure the distance of time sequences containing crisp values, intervals, and fuzzy intervals as well. The concept of distance measurement and its associated dynamic-programming-based algorithms are described. In addition to finding the sequences with similar evolving trends, a means of finding the sequences with opposite evolving tendencies is also proposed, which is usually omitted in current related research but could be of great interest to many users.

[1]  Weiyi Liu,et al.  The fuzzy association degree in semantic data models , 2001, Fuzzy Sets Syst..

[2]  Christos Faloutsos,et al.  Efficient Similarity Search In Sequence Databases , 1993, FODO.

[3]  Wei Yi Liu,et al.  The fuzzy functional dependency on the basis of the semantic distance , 1993 .

[4]  Eamonn J. Keogh,et al.  Scaling up dynamic time warping for datamining applications , 2000, KDD '00.

[5]  Thomas Janoski,et al.  Introduction to Time-Series Analysis , 1994 .

[6]  Dimitrios Gunopulos,et al.  Finding Similar Time Series , 1997, PKDD.

[7]  Siddhartha Bhattacharyya,et al.  Data mining on time series: an illustration using fast-food restaurant franchise data , 2001 .

[8]  I. Turksen,et al.  An approximate analogical reasoning schema based on similarity measures and interval-valued fuzzy sets , 1990 .

[9]  Christos Faloutsos,et al.  Efficient retrieval of similar time sequences under time warping , 1998, Proceedings 14th International Conference on Data Engineering.

[10]  Vassilios Petridis,et al.  FINkNN: A Fuzzy Interval Number k-Nearest Neighbor Classifier for Prediction of Sugar Production from Populations of Samples , 2003, J. Mach. Learn. Res..

[11]  Fei Wu,et al.  Knowledge discovery in time-series databases , 2001 .

[12]  B. Baets,et al.  A comparative study of similarity measures , 1995 .

[13]  Daniel P. Huttenlocher,et al.  Comparing Images Using the Hausdorff Distance , 1993, IEEE Trans. Pattern Anal. Mach. Intell..

[14]  Dimitrios Gunopulos,et al.  Time-series similarity problems and well-separated geometric sets , 1997, SCG '97.

[15]  Ambuj K. Singh,et al.  Similarity searching for multi-attribute sequences , 2002, Proceedings 14th International Conference on Scientific and Statistical Database Management.

[16]  Haixun Wang,et al.  Landmarks: a new model for similarity-based pattern querying in time series databases , 2000, Proceedings of 16th International Conference on Data Engineering (Cat. No.00CB37073).

[17]  Shyi-Ming Chen,et al.  Forecasting enrollments based on fuzzy time series , 1996, Fuzzy Sets Syst..

[18]  Stephen Shaoyi Liao,et al.  Functional dependencies with null values, fuzzy values, and crisp values , 1999, IEEE Trans. Fuzzy Syst..

[19]  A. Mendelzon,et al.  Efficient retrieval of similar time series , 2000 .

[20]  I. Burhan Türksen,et al.  An approximate analogical reasoning approach based on similarity measures , 1988, IEEE Trans. Syst. Man Cybern..

[21]  Edward R. Vrscay,et al.  Iterated fuzzy set systems: A new approach to the inverse problem for fractals and other sets , 1992 .

[22]  Thomas H. Cormen,et al.  Introduction to algorithms [2nd ed.] , 2001 .

[23]  Karthikeyan Ramasamy,et al.  Set Valued Attributes , 2005, Encyclopedia of Database Technologies and Applications.

[24]  Petra Perner,et al.  Data Mining - Concepts and Techniques , 2002, Künstliche Intell..

[25]  Man Hon Wong,et al.  Fast time-series searching with scaling and shifting , 1999, PODS '99.

[26]  Heikki Mannila,et al.  Similarity between Event Types in Sequences , 1999, DaWaK.

[27]  Juan Pedro Caraça-Valente,et al.  Discovering similar patterns in time series , 2000, KDD '00.

[28]  David J. Hand,et al.  Data Mining: Statistics and More? , 1998 .

[29]  Christos Faloutsos,et al.  Efficiently supporting ad hoc queries in large datasets of time sequences , 1997, SIGMOD '97.

[30]  Simon Parsons,et al.  Addendum to "Current Approaches to Handling Imperfect Information in Data and Knowledge Bases" , 1996, IEEE Trans. Knowl. Data Eng..

[31]  Tae-Hoon Kim,et al.  Shape-based retrieval of similar subsequences in time-series databases , 2002, SAC '02.

[32]  Pirjo Ronkainen,et al.  Attribute Similarity and Event Sequence Similarity in Data Mining , 1998 .

[33]  Didier Dubois,et al.  Possibility Theory - An Approach to Computerized Processing of Uncertainty , 1988 .

[34]  Robert Lowen,et al.  Distances between fuzzy sets representing grey level images , 1998, Fuzzy Sets Syst..

[35]  Stanislaw Heilpern,et al.  Representation and application of fuzzy numbers , 1997, Fuzzy Sets Syst..

[36]  Dina Q. Goldin,et al.  On Similarity Queries for Time-Series Data: Constraint Specification and Implementation , 1995, CP.

[37]  Kyuseok Shim,et al.  Fast Similarity Search in the Presence of Noise, Scaling, and Translation in Time-Series Databases , 1995, VLDB.

[38]  Tak-chung Fu,et al.  Flexible time series pattern matching based on perceptually important points , 2001 .

[39]  Qiang Song,et al.  A new fuzzy time-series model of fuzzy number observations , 1995 .