Learning Comprehensible Descriptions of Multivariate Time Series

Supervised classiication is one of the most active areas of machine learning research. Most work has focused on classiication in static domains, where an instantaneous snapshot of attributes is meaningful. In many domains, attributes are not static; in fact, it is the way they vary temporally that can make classiication possible. Examples of such domains include speech recognition, gesture recognition and electrocardiograph clas-siication. While it is possible to use ad hoc, domain-speciic techniques for \\atten-ing" the time series to a learner-friendly representation , this fails to take into account both the special problems and special heuris-tics applicable to temporal data and often results in unreadable concept descriptions. Though traditional time series techniques can sometimes produce accurate classiiers, few can provide comprehensible descriptions. We propose a general architecture for classiica-tion and description of multivariate time series. It employs event primitives to analyse the training data and extract events. These events are clustered, creating proto-typical events which are used as the basis for creating more accurate and comprehensi-ble classiiers. A minimal implementation of this architecture, called TClass, is applied to two domains, one real and one artiicial and compared against a na ve approach. TClass shows great promise, particularly in compre-hensibility, but also in accuracy.

[1]  T. Johnston,et al.  A.U.S.L.A.N dictionary : a dictionary of the sign language of the Australian deaf community , 1989 .

[2]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[3]  R. Coifman,et al.  Local feature extraction and its applications using a library of bases , 1994 .

[4]  Ron Kohavi,et al.  Irrelevant Features and the Subset Selection Problem , 1994, ICML.

[5]  Heikki Mannila,et al.  Discovering Frequent Episodes in Sequences , 1995, KDD.

[6]  W. Kadous GRASP: Recognition of Australian Sign Language Using Instrumented Gloves , 1995 .

[7]  金田 重郎,et al.  C4.5: Programs for Machine Learning (書評) , 1995 .

[8]  Ron Kohavi,et al.  Supervised and Unsupervised Discretization of Continuous Features , 1995, ICML.

[9]  Yuval Shahar,et al.  Knowledge-based temporal abstraction in clinical domains , 1996, Artif. Intell. Medicine.

[10]  Yoshua Bengio,et al.  Neural networks for speech and sequence recognition , 1996 .

[11]  Yuval Shahar,et al.  A Framework for Knowledge-Based Temporal Abstraction , 1997, Artif. Intell..

[12]  Eamonn J. Keogh,et al.  An Enhanced Representation of Time Series Which Allows Fast and Accurate Classification, Clustering and Relevance Feedback , 1998, KDD.

[13]  Geoffrey Zweig,et al.  Speech Recognition with Dynamic Bayesian Networks , 1998, AAAI/IAAI.

[14]  Heikki Mannila,et al.  Rule Discovery from Time Series , 1998, KDD.

[15]  Kevin P. Murphy,et al.  Learning the Structure of Dynamic Probabilistic Networks , 1998, UAI.

[16]  Paul R. Cohen,et al.  Discovering Rules for Clustering and Predicting Asynchronous Events , 1998 .

[17]  Paul R. Cohen,et al.  Concepts From Time Series , 1998, AAAI/IAAI.

[18]  A. Danyluk Predicting the Future : AI Approaches to Time-Series Problems , 2022 .