Joint segmentation and classification of time series using class-specific features

We present an approach for the joint segmentation and classification of a time series. The segmentation is on the basis of a menu of possible statistical models: each of these must be describable in terms of a sufficient statistic, but there is no need for these sufficient statistics to be the same, and these can be as complex (for example, cepstral features or autoregressive coefficients) as fits. All that is needed is the probability density function (PDF) of each sufficient statistic under its own assumed model-presumably this comes from training data, and it is particularly appealing that there is no need at all for a joint statistical characterization of all the statistics. There is similarly no need for an a-priori specification of the number of sections, as the approach uses an appropriate penalization of an over-zealous segmentation. The scheme has two stages. In stage one, rough segmentations are implemented sequentially using a piecewise generalized likelihood ratio (GLR); in the second stage, the results from the first stage (both forward and backward) are refined. The computational burden is remarkably small, approximately linear with the length of the time series, and the method is nicely accurate in terms both of discovered number of segments and of segmentation accuracy. A hybrid of the approach with one based on Gibbs sampling is also presented; this combination is somewhat slower but considerably more accurate.

[1]  A. Moghaddamjoo Automatic segmentation and classification of ionic-channel signals , 1991, IEEE Transactions on Biomedical Engineering.

[2]  Régine André-Obrecht,et al.  A new statistical approach for the automatic segmentation of continuous speech signals , 1988, IEEE Trans. Acoust. Speech Signal Process..

[3]  C. Elkan,et al.  Unsupervised learning of multiple motifs in biopolymers using expectation maximization , 1995, Machine Learning.

[4]  Peter Green,et al.  Markov chain Monte Carlo in Practice , 1996 .

[5]  Marc Lavielle,et al.  Optimal segmentation of random processes , 1998, IEEE Trans. Signal Process..

[6]  V. Piuri,et al.  Speech recognition through phoneme segmentation and neural classification , 1997, IEEE Instrumentation and Measurement Technology Conference Sensing, Processing, Networking. IMTC Proceedings.

[7]  Michèle Basseville,et al.  Sequential detection of abrupt changes in spectral characteristics of digital signals , 1983, IEEE Trans. Inf. Theory.

[8]  J. Besag On the Statistical Analysis of Dirty Pictures , 1986 .

[9]  Benjamin Friedlander,et al.  Performance analysis of transient detectors based on a class of linear data transforms , 1992, IEEE Trans. Inf. Theory.

[10]  Guojun Lu,et al.  An investigation of automatic audio classification and segmentation , 2000, WCC 2000 - ICSP 2000. 2000 5th International Conference on Signal Processing Proceedings. 16th World Computer Congress 2000.

[11]  Stephen Del Marco,et al.  Improved transient signal detection using a wavepacket-based detector with an extended translation-invariant wavelet transform , 1997, IEEE Trans. Signal Process..

[12]  Michèle Basseville,et al.  Detection of abrupt changes: theory and application , 1993 .

[13]  R. J. Kenefic,et al.  An algorithm to partition DFT data into sections of constant variance , 1998 .

[14]  É. Moulines,et al.  Least‐squares Estimation of an Unknown Number of Shifts in a Time Series , 2000 .

[15]  H. V. Trees Detection, Estimation, And Modulation Theory , 2001 .

[16]  Heinrich Niemann,et al.  A theoretically optimal probabilistic classifier using class-specific features , 2000, Proceedings 15th International Conference on Pattern Recognition. ICPR-2000.

[17]  Jorma Rissanen,et al.  Stochastic Complexity in Statistical Inquiry , 1989, World Scientific Series in Computer Science.

[18]  Zhen Wang,et al.  Two algorithms to segment white Gaussian data with piecewise constant variances , 2003, IEEE Trans. Signal Process..

[19]  Paul M. Baggenstoss Class-specific feature sets in classification , 1999, IEEE Trans. Signal Process..

[20]  C.-C. Jay Kuo,et al.  Audio content analysis for online audiovisual data segmentation and classification , 2001, IEEE Trans. Speech Audio Process..

[21]  S. Roucos,et al.  A stochastic segment model for phoneme-based continuous speech recognition , 1987, ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[22]  Mari Ostendorf,et al.  Fast algorithms for phone classification and recognition using segment-based models , 1992, IEEE Trans. Signal Process..

[23]  Zhen Wang,et al.  Class-specific segmentation of time series , 2003, 2003 IEEE Aerospace Conference Proceedings (Cat. No.03TH8652).

[24]  J. Rissanen,et al.  Modeling By Shortest Data Description* , 1978, Autom..

[25]  H. Akaike A new look at the statistical model identification , 1974 .

[26]  Achim V. Brandt,et al.  Detecting and estimating parameter jumps using ladder algorithms and likelihood ratio tests , 1983, ICASSP.

[27]  Albert H. Nuttall,et al.  Multidimensional probability density function approximations for detection, classification, and model order selection , 2001, IEEE Trans. Signal Process..

[28]  Thomas L. Hemminger,et al.  Detection and classification of underwater acoustic transients using neural networks , 1994, IEEE Trans. Neural Networks.

[29]  Olivier Cappé A Bayesian approach for simultaneous segmentation and classification of count data , 2002, IEEE Trans. Signal Process..