Learning recurrent behaviors from heterogeneous multivariate time-series

OBJECTIVE For the last years, time-series mining has become a challenging issue for researchers. An important application lies in most monitoring purposes, which require analyzing large sets of time-series for learning usual patterns. Any deviation from this learned profile is then considered as an unexpected situation. Moreover, complex applications may involve the temporal study of several heterogeneous parameters. In that paper, we propose a method for mining heterogeneous multivariate time-series for learning meaningful patterns. METHODS The proposed approach allows for mixed time-series - containing both pattern and non-pattern data - such as for imprecise matches, outliers, stretching and global translating of patterns instances in time. RESULTS We present the results of our approach on synthetic data generated in the context of monitoring a person at home, as well as early results on few real sequences. The purpose is to build a behavioral profile of a person in their daily activities by analyzing the time variations of several quantitative or qualitative parameters recorded through a provision of sensors. CONCLUSIONS The results are very promising. They also highlight the difficulty of tuning the parameters of the method.

[1]  Catherine Garbay,et al.  Mining Heterogeneous Multivariate Time-Series for Learning Meaningful Patterns: Application to Home Health Telecare , 2004, ArXiv.

[2]  Heikki Mannila,et al.  Rule Discovery from Time Series , 1998, KDD.

[3]  Marie Chavent,et al.  A monothetic clustering method , 1998, Pattern Recognit. Lett..

[4]  Catherine Garbay,et al.  Similarity measure for heterogeneous multivariate time-series , 2004, 2004 12th European Signal Processing Conference.

[5]  H. Mannila,et al.  Clustered segmentations , 2004 .

[6]  John F. Roddick,et al.  A Survey of Temporal Knowledge Discovery Paradigms and Methods , 2002, IEEE Trans. Knowl. Data Eng..

[7]  Mohammed J. Zaki,et al.  Scalable Feature Mining for Sequential Data , 2000, IEEE Intell. Syst..

[8]  Daniel Kudenko,et al.  Feature Generation for Sequence Categorization , 1998, AAAI/IAAI.

[9]  Dimitrios Gunopulos,et al.  Finding Similar Time Series , 1997, PKDD.

[10]  Eamonn J. Keogh,et al.  Probabilistic discovery of time series motifs , 2003, KDD '03.

[11]  Jacques Demongeot,et al.  Health "Smart" home: information technology for patients at home. , 2002, Telemedicine journal and e-health : the official journal of the American Telemedicine Association.

[12]  Yannis Manolopoulos,et al.  Feature-based classification of time-series data , 2001 .

[13]  Sylvian R. Ray,et al.  A new scheme for extracting multi-temporal sequence patterns , 1999, IJCNN'99. International Joint Conference on Neural Networks. Proceedings (Cat. No.99CH36339).

[14]  Eamonn J. Keogh,et al.  An Enhanced Representation of Time Series Which Allows Fast and Accurate Classification, Clustering and Relevance Feedback , 1998, KDD.

[15]  Deok-Hwan Kim,et al.  Similarity search for multidimensional data sequences , 2000, Proceedings of 16th International Conference on Data Engineering (Cat. No.00CB37073).

[16]  Frank Höppner Discovery of Temporal Patterns. Learning Rules about the Qualitative Behaviour of Time Series , 2001, PKDD.

[17]  János Abonyi,et al.  Modified Gath-Geva clustering for fuzzy segmentation of multivariate time-series , 2005, Fuzzy Sets Syst..

[18]  J M Mallion,et al.  Toward a portable blood pressure recorder device equipped with an accelerometer. , 1999, Medical engineering & physics.

[19]  Kyuseok Shim,et al.  Fast Similarity Search in the Presence of Noise, Scaling, and Translation in Time-Series Databases , 1995, VLDB.

[20]  Catherine Garbay,et al.  A hybrid Knowledge-Based Methodology for Multivariate Simulation in Home Health Telecare , 2003 .

[21]  A Franco,et al.  A smart room for hospitalised elderly people: essay of modelling and first steps of an experiment. , 1999, Technology and health care : official journal of the European Society for Engineering and Medicine.

[22]  V. Rialle,et al.  An hybrid refinement methodology for multivariate simulation in home health telecare , 2003, Proceedings 5th International Workshop on Enterprise Networking and Computing in Healthcare Industry (HealthCom).

[23]  Cláudia Antunes,et al.  Temporal Data Mining: an overview , 2001 .

[24]  Jeremy Buhler,et al.  Finding motifs using random projections , 2001, RECOMB.

[25]  Jessica Lin,et al.  Finding Motifs in Time Series , 2002, KDD 2002.

[26]  Dimitrios Gunopulos,et al.  Discovering similar multidimensional trajectories , 2002, Proceedings 18th International Conference on Data Engineering.

[27]  Thomas S. Huang,et al.  Learning to extract temporal signal patterns from temporal signal sequence , 2000, Proceedings 15th International Conference on Pattern Recognition. ICPR-2000.