Perception-based approach to time series data mining

Time series data mining (TSDM) techniques permit exploring large amounts of time series data in search of consistent patterns and/or interesting relationships between variables. TSDM is becoming increasingly important as a knowledge management tool where it is expected to reveal knowledge structures that can guide decision making in conditions of limited certainty. Human decision making in problems related with analysis of time series databases is usually based on perceptions like ''end of the day'', ''high temperature'', ''quickly increasing'', ''possible'', etc. Though many effective algorithms of TSDM have been developed, the integration of TSDM algorithms with human decision making procedures is still an open problem. In this paper, we consider architecture of perception-based decision making system in time series databases domains integrating perception-based TSDM, computing with words and perceptions, and expert knowledge. The new tasks which should be solved by the perception-based TSDM methods to enable their integration in such systems are discussed. These tasks include: precisiation of perceptions, shape pattern identification, and pattern retranslation. We show how different methods developed so far in TSDM for manipulation of perception-based information can be used for development of a fuzzy perception-based TSDM approach. This approach is grounded in computing with words and perceptions permitting to formalize human perception-based inference mechanisms. The discussion is illustrated by examples from economics, finance, meteorology, medicine, etc.

[1]  L. Sheremetov,et al.  Association networks in time series data mining , 2005, NAFIPS 2005 - 2005 Annual Meeting of the North American Fuzzy Information Processing Society.

[2]  Thomas Sudkamp,et al.  Examples, counterexamples, and measuring fuzzy associations , 2005, Fuzzy Sets Syst..

[3]  F. Höppner Learning Temporal Rules from State Sequences , 2001 .

[4]  Wolfgang Spohn,et al.  The Representation of , 1986 .

[5]  Olga Pons,et al.  Knowledge Management in Fuzzy Databases , 2000 .

[6]  Gloria Bordogna,et al.  Recent Issues on Fuzzy Databases , 2000 .

[7]  Leonard Ray Teel The Weather Channel , 1982 .

[8]  Lotfi A. Zadeh,et al.  From Computing with Numbers to Computing with Words - from Manipulation of Measurements to Manipulation of Perceptions , 2005, Logic, Thought and Action.

[9]  Ildar Batyrshin,et al.  Towards a linguistic description of dependencies in data , 2002 .

[10]  Charu C. Aggarwal,et al.  Proceedings of the 8th ACM SIGMOD workshop on Research issues in data mining and knowledge discovery, DMKD 2003, San Diego, California, USA, June 13, 2003 , 2003, DMKD.

[11]  Ildar Z. Batyrshin,et al.  Moving Approximation Transform and Local Trend Associations inTime Series Data Bases , 2007, Perception-based Data Mining and Decision Making in Economics and Finance.

[12]  Jim Hunter,et al.  Choosing words in computer-generated weather forecasts , 2005, Artif. Intell..

[13]  Etienne Kerre,et al.  Fuzzy Data Mining: Discovery of Fuzzy Generalized Association Rules+ , 2000 .

[14]  James F. Allen Maintaining knowledge about temporal intervals , 1983, CACM.

[15]  Ildar Batyrshin,et al.  Towards Perception Based Time Series Data Mining , 2007 .

[16]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[17]  Ronald R. Yager,et al.  On Linguistic Summaries of Data , 1991, Knowledge Discovery in Databases.

[18]  Lotfi A. Zadeh,et al.  Precisiated Natural Language , 2007, Aspects of Automatic Text Analysis.

[19]  Laveen N. Kanal,et al.  Structural pattern recognition of Carotid pulse waves using a general waveform parsing system , 1976, CACM.

[20]  Heikki Mannila,et al.  Principles of Data Mining , 2001, Undergraduate Topics in Computer Science.

[21]  E. Mizutani,et al.  Neuro-Fuzzy and Soft Computing-A Computational Approach to Learning and Machine Intelligence [Book Review] , 1997, IEEE Transactions on Automatic Control.

[22]  Toshiomi Yoshida,et al.  Real-time qualitative analysis of the temporal shapes of (bio) process variables , 1992 .

[23]  C. D. Olds On the representations, $N_3 \left( {n^2 } \right)$ , 1941 .

[24]  Didier Dubois,et al.  On the representation, measurement, and discovery of fuzzy associations , 2005, IEEE Transactions on Fuzzy Systems.

[25]  Abraham Kandel,et al.  Data Mining and Computational Intelligence , 2001 .

[26]  Johannes Ledolter,et al.  Time series and forecasting : an applied approach , 1981 .

[27]  Karen Kukich,et al.  Design of a Knowledge-Based Report Generator , 1983, ACL.

[28]  Eyke Hüllermeier,et al.  A Note on Quality Measures for Fuzzy Asscociation Rules , 2003, IFSA.

[29]  Nikola Kasabov Fril—fuzzy and evidential reasoning in artificial intelligence , 1996 .

[30]  Janusz Kacprzyk,et al.  Data Mining via Linguistic Summaries of Databases: An Interactive Approach , 2001 .

[31]  William Frawley,et al.  Knowledge Discovery in Databases , 1991 .

[32]  Ildar Z. Batyrshin,et al.  Perception Based Time Series Data Mining with MAP Transform , 2005, MICAI.

[33]  Ildar Z. Batyrshin,et al.  Construction of Granular Derivatives and Solution of Granular Initial Value Problem , 2004 .

[34]  Lotfi A. Zadeh,et al.  The Concepts of a Linguistic Variable and its Application to Approximate Reasoning , 1975 .

[35]  Giuseppe Psaila,et al.  Querying Shapes of Histories , 1995, VLDB.

[36]  Chris Mellish,et al.  Choosing the content of textual summaries of large time-series data sets , 2006, Natural Language Engineering.

[37]  Paul R. Cohen,et al.  An Algorithm for Segmenting Categorical Time Series into Meaningful Episodes , 2001, IDA.

[38]  Lotfi A. Zadeh,et al.  Precisiated Natural Language (PNL) , 2004, AI Mag..

[39]  Ute St. Clair,et al.  Fuzzy Set Theory: Foundations and Applications , 1997 .

[40]  James C. Bezdek,et al.  Fuzzy Models and Digital Signal Processing(for Pattern Recognition): Is This a Good Marriage? , 1993 .

[41]  David B. Lomet,et al.  Proceedings of the 4th International Conference on Foundations of Data Organization and Algorithms , 1993 .

[42]  Simon Parsons,et al.  Principles of Data Mining by David J. Hand, Heikki Mannila and Padhraic Smyth, MIT Press, 546 pp., £34.50, ISBN 0-262-08290-X , 2004, The Knowledge Engineering Review.

[43]  Lotfi A. Zadeh,et al.  The concept of a linguistic variable and its application to approximate reasoning-III , 1975, Inf. Sci..

[44]  G. Stephanopoulos,et al.  Representation of process trends—Part I. A formal representation framework , 1990 .

[45]  Masoud Nikravesh,et al.  Fuzzy Partial Differential Equations and Relational Equations , 2004 .

[46]  Jim Hunter,et al.  Segmenting Time Series for Weather Forecasting , 2003 .

[47]  Fei Wu,et al.  Knowledge discovery in time-series databases , 2001 .

[48]  Ronald R. Yager,et al.  On the retranslation process in Zadeh's paradigm of computing with words , 2004, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[49]  Sarah E. Boyd TREND: A System for Generating Intelligent Descriptions of Time-Series Data , 1998 .

[50]  L. Zadeh From Computing with Numbers to Computing with Words , 2001 .

[51]  Eamonn J. Keogh,et al.  A symbolic representation of time series, with implications for streaming algorithms , 2003, DMKD '03.

[52]  Lotfi A. Zadeh,et al.  Shadows of fuzzy sets , 1996 .

[53]  Trevor P Martin,et al.  Time series modelling and prediction using fuzzy trend information , 1998 .

[54]  Ildar Batyrshin,et al.  Perception-based Data Mining and Decision Making in Economics and Finance , 2007, Studies in Computational Intelligence.

[55]  Lotfi A. Zadeh,et al.  Outline of a New Approach to the Analysis of Complex Systems and Decision Processes , 1973, IEEE Trans. Syst. Man Cybern..

[56]  Qiang Wang,et al.  A symbolic representation of time series , 2005, Proceedings of the Eighth International Symposium on Signal Processing and Its Applications, 2005..

[57]  Christos Faloutsos,et al.  Efficient Similarity Search In Sequence Databases , 1993, FODO.

[58]  Masoud Nikravesh,et al.  Fuzzy Partial Differential Equations and Relational Equations: Reservoir Characterization And Modeling , 2004 .

[59]  Sankar K. Pal,et al.  Data mining in soft computing framework: a survey , 2002, IEEE Trans. Neural Networks.

[60]  Heikki Mannila,et al.  Rule Discovery from Time Series , 1998, KDD.