Creating Discriminative Models for Time Series Classification and Clustering by HMM Ensembles

Classification of temporal data sequences is a fundamental branch of machine learning with a broad range of real world applications. Since the dimensionality of temporal data is significantly larger than static data, and its modeling and interpreting is more complicated, performing classification and clustering on temporal data is more complex as well. Hidden Markov models (HMMs) are well-known statistical models for modeling and analysis of sequence data. Besides, ensemble methods, which employ multiple models to obtain the target model, revealed good performances in the conducted experiments. All these facts are a high level of motivation to employ HMM ensembles in the task of classification and clustering of time series data. So far, no effective classification and clustering method based on HMM ensembles has been proposed. Moreover, employing the limited existing HMM ensemble methods has trouble separating models of distinct classes as a vital task. In this paper, according to previous points a new framework based on HMM ensembles for classification and clustering is proposed. In addition to its strong theoretical background by employing the Rényi entropy for ensemble learning procedure, the main contribution of the proposed method is addressing HMM-based methods problem in separating models of distinct classes by considering the inverse emission matrix of the opposite class to build an opposite model. The proposed algorithms perform more effectively compared to other methods especially other HMM ensemble-based methods. Moreover, the proposed clustering framework, which derives benefits from both similarity-based and model-based methods, together with the Rényi-based ensemble method revealed its superiority in several measurements.

[1]  L. R. Rabiner,et al.  An introduction to the application of the theory of probabilistic functions of a Markov process to automatic speech recognition , 1983, The Bell System Technical Journal.

[2]  Robert M. Gray,et al.  Image Classification by a Two-Dimensional Hidden , 2000 .

[3]  Sean R. Eddy,et al.  Multiple Alignment Using Hidden Markov Models , 1995, ISMB.

[4]  øöö Blockinøø Well-Trained PETs : Improving Probability Estimation , 2000 .

[5]  Monson H. Hayes,et al.  Hidden Markov models for face recognition , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[6]  Gautam Biswas,et al.  Temporal Pattern Generation Using Hidden Markov Model Based Unsupervised Classification , 1999, IDA.

[7]  Fu-Rong Jean,et al.  Adaptation of Hidden Markov Models for Recognizing Speech of Reduced Frame Rate , 2013, IEEE Transactions on Cybernetics.

[8]  L. Baum,et al.  Statistical Inference for Probabilistic Functions of Finite State Markov Chains , 1966 .

[9]  Padhraic Smyth,et al.  A general probabilistic framework for clustering individuals and objects , 2000, KDD '00.

[10]  Qing-Bin Gao,et al.  Human activity recognition with beta process hidden Markov models , 2013, 2013 International Conference on Machine Learning and Cybernetics.

[11]  James T. Kwok,et al.  Rival penalized competitive learning for model-based sequence clustering , 2000, Proceedings 15th International Conference on Pattern Recognition. ICPR-2000.

[12]  Brian C. Lovell,et al.  Improved Ensemble Training for Hidden Markov Models using Random Relative Node Permutations , 2003 .

[13]  Abdolreza Mirzaei,et al.  Multiple Observations HMM Learning by Aggregating Ensemble Models , 2013, IEEE Transactions on Signal Processing.

[14]  Biing-Hwang Juang,et al.  HMM clustering for connected word recognition , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[15]  Behrouz Homayoun Far,et al.  Calculating the strength of ties of a social network in a semantic search system using hidden Markov models , 2011, 2011 IEEE International Conference on Systems, Man, and Cybernetics.

[16]  Charles X. Ling,et al.  AUC: A Better Measure than Accuracy in Comparing Learning Algorithms , 2003, Canadian Conference on AI.

[17]  G. Schwarz Estimating the Dimension of a Model , 1978 .

[18]  Peter J. Rousseeuw,et al.  Clustering by means of medoids , 1987 .

[19]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[20]  Cen Li,et al.  A bayesian approach to temporal data clustering using the hidden markov model methodology , 2000 .

[21]  Nianjun Liu,et al.  Gesture Classification Using Hidden Markov Models and Viterbi Path Counting , 2003, DICTA.

[22]  Brian C. Lovell,et al.  Improved Classification Using Hidden Markov Averaging From Multiple Observation Sequences , 2002 .

[23]  Geir Hovland,et al.  Hidden Markov Models as a Process Monitor in Robotic Assembly , 1998, Int. J. Robotics Res..

[24]  Brian C. Lovell,et al.  Improved estimation of hidden Markov model parameters from multiple observation sequences , 2002, Object recognition supported by user interaction for service robots.

[25]  Guo Hongwei,et al.  Time Series Clustering Based on Different Width Time Window Partition , 2007 .

[26]  B. Juang,et al.  Context-dependent Phonetic Hidden Markov Models for Speaker-independent Continuous Speech Recognition , 2008 .

[27]  Flávio Bortolozzi,et al.  A comparison of SVM and HMM classifiers in the off-line signature verification , 2005, Pattern Recognit. Lett..

[28]  Gautam Biswas,et al.  Matryoshka: A HMM based temporal data clustering methodology for modeling system dynamics , 2002, Intell. Data Anal..

[29]  Andreas Stolcke,et al.  Hidden Markov Model} Induction by Bayesian Model Merging , 1992, NIPS.

[30]  L. Baum,et al.  Growth transformations for functions on manifolds. , 1968 .

[31]  Charles X. Ling,et al.  Toward Bayesian Classifiers with Accurate Probabilities , 2002, PAKDD.

[32]  Robert M. Gray,et al.  Image classification by a two-dimensional hidden Markov model , 2000, IEEE Trans. Signal Process..

[33]  Chien-Liang Liu,et al.  An HMM-Based Algorithm for Content Ranking and Coherence-Feature Extraction , 2013, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[34]  Padhraic Smyth,et al.  Clustering Sequences with Hidden Markov Models , 1996, NIPS.

[35]  L. Baum,et al.  An inequality with applications to statistical estimation for probabilistic functions of Markov processes and to a model for ecology , 1967 .

[36]  Cranos M. Williams,et al.  An HMM approach to identify components that influence phenotypes , 2014, 2014 IEEE International Conference on Systems, Man, and Cybernetics (SMC).

[37]  L. Baum,et al.  A Maximization Technique Occurring in the Statistical Analysis of Probabilistic Functions of Markov Chains , 1970 .

[38]  Gautam Biswas,et al.  A Bayesian Approach to Temporal Data Clustering using Hidden Markov Models , 2000, ICML.

[39]  L. Baum,et al.  An inequality and associated maximization technique in statistical estimation of probabilistic functions of a Markov process , 1972 .

[40]  Gautam Biswas,et al.  Clustering sequence data using hidden Markov model representation , 1999, Defense, Security, and Sensing.

[41]  Zhenyu He,et al.  Writer identification of Chinese handwriting documents using hidden Markov tree model , 2008, Pattern Recognit..

[42]  Cen Li,et al.  Applying the Hidden Markov Model Methodology for Unsupervised Learning of Temporal Data , 2002 .

[43]  Wei Wang,et al.  A prediction algorithm for time series based on adaptive model selection , 2009, Expert Syst. Appl..

[44]  Mário A. T. Figueiredo,et al.  Similarity-Based Clustering of Sequences Using Hidden Markov Models , 2003, MLDM.

[45]  Lalit R. Bahl,et al.  Maximum mutual information estimation of hidden Markov model parameters for speech recognition , 1986, ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[46]  Biing-Hwang Juang,et al.  Fundamentals of speech recognition , 1993, Prentice Hall signal processing series.

[47]  Manuele Bicego,et al.  A Hidden Markov Model-Based Approach to Sequential Data Clustering , 2002, SSPR/SPR.

[48]  Kai-Fu Lee,et al.  Context-independent phonetic hidden Markov models for speaker-independent continuous speech recognition , 1990 .