Mining fuzzy rules for time series classification

Time series classification is concerned about discovering classification models in a database of pre-classified time series and using them to classify unseen time series. To better handle the noises and fuzziness in time series data, we propose a new data mining technique to mine fuzzy rules in the data. The fuzzy rules discovered employ fuzzy sets to represent the revealed regularities and exceptions. The resilience of fuzzy sets to noises allows the proposed approach to better handle the noises embedded in the data. Furthermore, it uses the adjusted residual as an objective measure to evaluate the interestingness of association relationships hidden in the data. The adjusted residual analysis allows the differentiation of interesting relationships from uninteresting ones without any user-specified thresholds. To evaluate the performance of the proposed approach, we applied it to several well-known time series datasets. The experimental results showed that our approach is very promising.

[1]  Piotr Indyk,et al.  Identifying Representative Trends in Massive Time Series Data Sets Using Sketches , 2000, VLDB.

[2]  Eugene Fink,et al.  Search for Patterns in Compressed Time Series , 2002, Int. J. Image Graph..

[3]  Changzhou Wang,et al.  Supporting content-based searches on time series via approximation , 2000, Proceedings. 12th International Conference on Scientific and Statistica Database Management.

[4]  Henrik André-Jönsson,et al.  Using Signature Files for Querying Time-Series Data , 1997, PKDD.

[5]  Jaideep Srivastava,et al.  Event detection from time series data , 1999, KDD '99.

[6]  Heikki Mannila,et al.  Discovering Frequent Episodes in Sequences , 1995, KDD.

[7]  Keith C. C. Chan,et al.  Mining fuzzy association rules , 1997, CIKM '97.

[8]  Eamonn J. Keogh,et al.  On the Need for Time Series Data Mining Benchmarks: A Survey and Empirical Demonstration , 2002, Data Mining and Knowledge Discovery.

[9]  Leo Breiman,et al.  Classification and Regression Trees , 1984 .

[10]  Keith C. C. Chan,et al.  APACS: a system for the automatic analysis and classification of conceptual patterns , 1990, Comput. Intell..

[11]  Jiawei Han,et al.  Efficient mining of partial periodic patterns in time series database , 1999, Proceedings 15th International Conference on Data Engineering (Cat. No.99CB36337).

[12]  Rakesh Agarwal,et al.  Fast Algorithms for Mining Association Rules , 1994, VLDB 1994.

[13]  Keith C. C. Chan,et al.  Mining fuzzy association rules in a database containing relational and transactional data , 2001 .

[14]  Zbigniew R. Struzik,et al.  The Haar Wavelet Transform in the Time Series Similarity Paradigm , 1999, PKDD.

[15]  Nasser Yazdani,et al.  Matching and indexing sequences of different lengths , 1997, CIKM '97.

[16]  Xin Yao,et al.  A novel evolutionary data mining algorithm with applications to churn prediction , 2003, IEEE Trans. Evol. Comput..

[17]  Keith C. C. Chan,et al.  Mining fuzzy association rules in a bank-account database , 2003, IEEE Trans. Fuzzy Syst..

[18]  Ramakrishnan Srikant,et al.  Mining sequential patterns , 1995, Proceedings of the Eleventh International Conference on Data Engineering.

[19]  Philip S. Yu,et al.  Adaptive query processing for time-series data , 1999, KDD '99.

[20]  Fei Wu,et al.  Knowledge discovery in time-series databases , 2001 .

[21]  Jiawei Han,et al.  Data Mining: Concepts and Techniques , 2000 .

[22]  Juan José Rodríguez Diez,et al.  Time Series Classification by Boosting Interval Based Literals , 2000, Inteligencia Artif..

[23]  Christopher M. Bishop,et al.  Classification and regression , 1997 .

[24]  Alan Agresti,et al.  Categorical Data Analysis , 1991, International Encyclopedia of Statistical Science.

[25]  Andrew K. C. Wong,et al.  Statistical Technique for Extracting Classificatory Knowledge from Databases , 1991, Knowledge Discovery in Databases.

[26]  Hongjun Lu,et al.  Stock movement prediction and N-dimensional inter-transaction association rules , 1998, SIGMOD 1998.

[27]  Hannu Toivonen,et al.  Mining for similarities in aligned time series using wavelets , 1999, Defense, Security, and Sensing.

[28]  Heikki Mannila,et al.  Rule Discovery from Time Series , 1998, KDD.

[29]  Padhraic Smyth,et al.  An Information Theoretic Approach to Rule Induction from Databases , 1992, IEEE Trans. Knowl. Data Eng..

[30]  R. J. Alcock,et al.  Time-Series Similarity Queries Employing a Feature-Based Approach , 1999 .

[31]  Piotr Indyk,et al.  Mining the stock market (extended abstract): which measure is best? , 2000, KDD '00.

[32]  James C. Bezdek,et al.  Pattern Recognition with Fuzzy Objective Function Algorithms , 1981, Advanced Applications in Pattern Recognition.

[33]  Keith C. C. Chan,et al.  Classification with degree of membership: a fuzzy approach , 2001, Proceedings 2001 IEEE International Conference on Data Mining.

[34]  Eamonn J. Keogh,et al.  A Probabilistic Approach to Fast Pattern Matching in Time Series Databases , 1997, KDD.

[35]  Dragomir Anguelov,et al.  Mining The Stock Market : Which Measure Is Best ? , 2000 .

[36]  Konstantinos Kalpakis,et al.  Distance measures for effective clustering of ARIMA time-series , 2001, Proceedings 2001 IEEE International Conference on Data Mining.

[37]  W. Peizhuang Pattern Recognition with Fuzzy Objective Function Algorithms (James C. Bezdek) , 1983 .