Pattern recognition for command and control data systems

To analyze real-world events, researchers collect observation data from an underlying process and construct models to represent the observed situation. In this work, we consider issues that affect the construction and usage of a specific type of model. Markov models are commonly used because their combination of discrete states and stochastic transitions is suited to applications with both deterministic and stochastic components. Hidden Markov Models (HMMs) are a class of Markov model commonly used in pattern recognition. We first demonstrate how to construct HMMs using only the observation data, and no a priori information, by extending a previously developed approach from J.P. Crutchfield and C.R. Shalizi. We also show how to determine with a level of statistical confidence whether or not the model fully encapsulates the underlying process. Once models are constructed from observation data, the models are used to identify other types of observations. Traditional approaches consider the maximum likelihood that the model matches the observation, solving a classification problem. We present a new method using confidence intervals and receiver operating characteristic curves. Our method solves a detection problem by determining if observation data matches zero, one, or more than one model. To detect the occurrence of a behavior in observation data, one must consider the amount of data required. We consider behaviors to be “serial Markovian," when the behavior can change from one model to another at any time. When analyzing observation data, considering too much data induces high delay and could lead to confusion in the system if multiple behaviors are observed in the data stream. If too little data is used, the system has a high false positive rate and is unable to correctly detect behaviors. We demonstrate the effectiveness of all methods using illustrative examples and consumer behavior data.

[1]  C. Shalizi,et al.  Causal architecture, complexity and self-organization in time series and cellular automata , 2001 .

[2]  L. Brown,et al.  Interval Estimation for a Binomial Proportion , 2001 .

[3]  Jen-Tzung Chien,et al.  Predictive hidden Markov model selection for speech recognition , 2005, IEEE Transactions on Speech and Audio Processing.

[4]  James P. Crutchfield,et al.  Computational Mechanics: Pattern and Prediction, Structure and Simplicity , 1999, ArXiv.

[5]  Richard R. Brooks,et al.  On-Line Behavior Recognition for Situation and Threat Assessment , 2007 .

[6]  Yangsheng Xu,et al.  On the fidelity of human skill models , 1996, Proceedings of IEEE International Conference on Robotics and Automation.

[7]  Satish T. S. Bukkapatnam,et al.  Zero knowledge hidden Markov model inference , 2009, Pattern Recognit. Lett..

[8]  S. Katagiri,et al.  Discriminative Learning for Minimum Error Classification , 2009 .

[9]  Bernadette Dorizzi,et al.  On Using the Viterbi Path Along With HMM Likelihood Information for Online Signature Verification , 2007, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[10]  Jean Dickinson Gibbons,et al.  Nonparametric Statistical Inference , 1972, International Encyclopedia of Statistical Science.

[11]  Sudeep Sarkar,et al.  Improved gait recognition by gait dynamics normalization , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[12]  Alain Biem,et al.  Minimum classification error training for online handwriting recognition , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[13]  H. T. Kung,et al.  A new methodology for easily constructing extensible and high-fidelity TCP/IP network simulators , 2002, Comput. Networks.

[14]  L Deng,et al.  Structural design of hidden Markov model speech recognizer using multivalued phonetic features: comparison with segmental speech units. , 1992, The Journal of the Acoustical Society of America.

[15]  James P. Crutchfield,et al.  An Algorithm for Pattern Discovery in Time Series , 2002, ArXiv.

[16]  Ivandro Sanches Noise-compensated hidden Markov models , 2000, IEEE Trans. Speech Audio Process..

[17]  S. Shapiro,et al.  An Analysis of Variance Test for Normality (Complete Samples) , 1965 .

[18]  M. Evans,et al.  Statistical Distributions, Third Edition , 2001 .

[19]  A. Agresti,et al.  Approximate is Better than “Exact” for Interval Estimation of Binomial Proportions , 1998 .

[20]  Qifeng Yu,et al.  An Adaptive Contoured Window Filter for Interferometric Synthetic Aperture Radar , 2007, IEEE Geoscience and Remote Sensing Letters.

[21]  Kaizhu Huang,et al.  Imbalanced learning with a biased minimax probability machine , 2006, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[22]  Biing-Hwang Juang,et al.  Discriminative learning for minimum error classification [pattern recognition] , 1992, IEEE Trans. Signal Process..

[23]  Oscal T.-C. Chen,et al.  EEG pattern recognition-arousal states detection and classification , 1996, Proceedings of International Conference on Neural Networks (ICNN'96).

[24]  Karim Faez,et al.  Off-line unconstrained Farsi handwritten word recognition using fuzzy vector quantization and hidden Markov word models , 2000, Proceedings 15th International Conference on Pattern Recognition. ICPR-2000.

[25]  Svetha Venkatesh,et al.  Human action segmentation via controlled use of missing data in HMMs , 2004, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004..

[26]  Peter B. Borwein,et al.  On the Complexity of Calculating Factorials , 1985, J. Algorithms.

[27]  Eric Moulines,et al.  Inference in hidden Markov models , 2010, Springer series in statistics.

[28]  Li Min,et al.  A Network-wide Traffic Anomaly Detection Method Based on HSMM , 2006, 2006 International Conference on Communications, Circuits and Systems.

[29]  K. Kiguchi,et al.  Modular fuzzy-neuro controller driven by spoken language commands , 2004, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[30]  Marcus Liwicki,et al.  Handwriting Recognition of Whiteboard Notes - Studying the Influence of Training Set Size and Type , 2007, Int. J. Pattern Recognit. Artif. Intell..

[31]  R. Larsen,et al.  Introduction to Probability and Its Applications , 1985 .

[32]  Daniel Ray Upper,et al.  Theory and algorithms for hidden Markov models and generalized hidden Markov models , 1998 .

[33]  L. Baum,et al.  A Maximization Technique Occurring in the Statistical Analysis of Probabilistic Functions of Markov Chains , 1970 .

[34]  Venu Govindaraju,et al.  Hidden Markov models combining discrete symbols and continuous attributes in handwriting recognition , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[35]  Yangsheng Xu,et al.  Human sensation modeling in virtual environments , 2000, Proceedings. 2000 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2000) (Cat. No.00CH37113).

[36]  Donghyun Kim,et al.  Linear Spectral Transformation for Robust Speech Recognition Using Maximum Mutual Information , 2007, IEEE Signal Processing Letters.

[37]  Mari Ostendorf,et al.  HMM topology design using maximum likelihood successive state splitting , 1997, Comput. Speech Lang..

[38]  Yuan Yan Tang,et al.  Improved class statistics estimation for sparse data problems in offline signature verification , 2005, IEEE Trans. Syst. Man Cybern. Part C.

[39]  L. A. Goodman On Simultaneous Confidence Intervals for Multinomial Proportions , 1965 .

[40]  Naonori Ueda,et al.  Exploitation of Unlabeled Sequences in Hidden Markov Models , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[41]  Charles M. Grinstead,et al.  Introduction to probability , 1999, Statistics for the Behavioural Sciences.

[42]  Ghulam H Bham,et al.  A HIGH FIDELITY TRAFFIC SIMULATION MODEL BASED ON CELLULAR AUTOMATA AND CAR-FOLLOWING CONCEPTS , 2004 .

[43]  A. Sarma,et al.  Improving CFAR detection through adaptive determination of reference window extents , 2005, Proceedings of OCEANS 2005 MTS/IEEE.

[44]  Brian Kan-Wing Mak,et al.  Pruning hidden Markov models with optimal brain surgeon , 2005, IEEE Transactions on Speech and Audio Processing.

[45]  Jonathan Le Roux,et al.  Discriminative Training for Large-Vocabulary Speech Recognition Using Minimum Classification Error , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[46]  L. Prasanth,et al.  HMM-Based Online Handwriting Recognition System for Telugu Symbols , 2007 .

[47]  Jie Yang,et al.  Gabor phase embedding of Gait Energy Image for identity recognition , 2008, 2008 8th IEEE International Conference on Computer and Information Technology.

[48]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[49]  Jayant Rajgopal,et al.  Modular Operational Test Plans for Inferences on Software Reliability Based on a Markov Model , 2002, IEEE Trans. Software Eng..

[50]  Jiung-yao Huang,et al.  Modelling and designing a low-cost high-fidelity mobile crane simulator , 2003, Int. J. Hum. Comput. Stud..

[51]  Thomas S. Huang,et al.  Multicue HMM-UKF for real-time contour tracking , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[52]  Athanasios Papoulis,et al.  Probability, random variables, and stochastic processes , 2002 .

[53]  M. Jack,et al.  Formant estimation system based on weighted least-squares lattice filters , 1988 .

[54]  E. Steinhart A Mathematical Model of Divine Infinity , 2009 .

[55]  Robert I. Damper,et al.  Improving speaker identification in noise by subband processing and decision fusion , 2003, Pattern Recognit. Lett..

[56]  Kristina Lisa Shalizi,et al.  Pattern Discovery in Time Series, Part I: Theory, Algorithm, Analysis, and Convergence , 2002 .

[57]  Nicole Vincent,et al.  A new way to use hidden Markov models for object tracking in video sequences , 2003, Proceedings 2003 International Conference on Image Processing (Cat. No.03CH37429).

[58]  Mohammed J. Zaki,et al.  Mining residue contacts in proteins using local structure predictions , 2000, Proceedings IEEE International Symposium on Bio-Informatics and Biomedical Engineering.

[59]  Li Deng,et al.  Speech recognition using the atomic speech units constructed from overlapping articulatory features , 1994, EUROSPEECH.

[60]  Alan Wee-Chung Liew,et al.  An Automatic Lipreading System for Spoken Digits With Limited Training Data , 2008, IEEE Transactions on Circuits and Systems for Video Technology.

[61]  P. O'Shea,et al.  The use of sliding spectral windows for parameter estimation of decaying sinusoidal signals , 1997, TENCON '97 Brisbane - Australia. Proceedings of IEEE TENCON '97. IEEE Region 10 Annual Conference. Speech and Image Technologies for Computing and Telecommunications (Cat. No.97CH36162).

[62]  Richard L. Scheaffer,et al.  Introduction to Probability and Its Applications. , 1991 .

[63]  Joemon M. Jose,et al.  An Audio-Based Sports Video Segmentation and Event Detection Algorithm , 2004, 2004 Conference on Computer Vision and Pattern Recognition Workshop.

[64]  L. Baum,et al.  An inequality with applications to statistical estimation for probabilistic functions of Markov processes and to a model for ecology , 1967 .

[65]  James Bennett,et al.  The Netflix Prize , 2007 .

[66]  Jukka Heikkonen,et al.  Minimum Description Length Denoising With Histogram Models , 2006, IEEE Transactions on Signal Processing.

[67]  Cosma Rohilla Shalizi,et al.  Blind Construction of Optimal Nonlinear Recursive Predictors for Discrete Sequences , 2004, UAI.

[68]  Robert F. Ling Just Say no to Binomial (and other Discrete Distributions) Tables , 1992 .

[69]  Kenneth Rose,et al.  A probabilistic model of face mapping with local transformations and its application to person recognition , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[70]  Edward F. Moore,et al.  Gedanken-Experiments on Sequential Machines , 1956 .

[71]  Sung-Bae Cho,et al.  Evolutionary neural networks for anomaly detection based on the behavior of a program , 2005, IEEE Trans. Syst. Man Cybern. Part B.

[72]  John N. Tsitsiklis,et al.  Introduction to Probability , 2002 .

[73]  Jr. G. Forney,et al.  The viterbi algorithm , 1973 .

[74]  George H. Mealy,et al.  A method for synthesizing sequential circuits , 1955 .

[75]  Jianying Hu,et al.  HMM Based On-Line Handwriting Recognition , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[76]  Kuntal Sengupta,et al.  A hybrid approach of NN and HMM for facial emotion classification , 2002, Pattern Recognit. Lett..

[77]  Filip Lukaszewski,et al.  Pattern classification with incremental class learning and Hidden Markov models , 2005, 5th International Conference on Intelligent Systems Design and Applications (ISDA'05).

[78]  Markus Schenkel,et al.  Off-line cursive handwriting recognition compared with on-line recognition , 1996, Proceedings of 13th International Conference on Pattern Recognition.

[79]  Sen Jia,et al.  An advanced segmental semi-Markov model based online series pattern detection , 2004, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004..

[80]  Jorma Rissanen,et al.  The Minimum Description Length Principle in Coding and Modeling , 1998, IEEE Trans. Inf. Theory.

[81]  Jonathan L. Gross,et al.  Handbook of graph theory , 2007, Discrete mathematics and its applications.

[82]  Qu Dan,et al.  Discriminative training of GMM based on Maximum Mutual Information for language identification , 2006, 2006 6th World Congress on Intelligent Control and Automation.

[83]  H. Kobayashi,et al.  An efficient forward-backward algorithm for an explicit-duration hidden Markov model , 2003, IEEE Signal Processing Letters.

[84]  Jason M. Schwier,et al.  Behavior Detection Using Confidence Intervals of Hidden Markov Models , 2009, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[85]  S. Gazor,et al.  AM-FM decomposition of speech signal using MWL criterion , 2004, Canadian Conference on Electrical and Computer Engineering 2004 (IEEE Cat. No.04CH37513).

[86]  Michael R. Lyu,et al.  Face Annotation Using Transductive Kernel Fisher Discriminant , 2008, IEEE Transactions on Multimedia.

[87]  Neil Salkind Encyclopedia of Measurement and Statistics , 2006 .

[88]  Guru P. Guruswamy,et al.  A review of numerical fluids/structures interface methods for computations using high-fidelity equations , 2002 .

[89]  J. A. Herbst A microscale look at tumbling mill scale-up using high fidelity simulation , 2004 .

[90]  A. Kundu,et al.  Rotation and Gray Scale Transform Invariant Texture Identification using Wavelet Decomposition and Hidden Markov Model , 1994, IEEE Trans. Pattern Anal. Mach. Intell..

[91]  Aaron F. Bobick,et al.  Action recognition using probabilistic parsing , 1998, Proceedings. 1998 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No.98CB36231).

[92]  Daijin Kim,et al.  Simultaneous Gesture Segmentation and Recognition based on Forward Spotting Accumulative HMMs , 2006, 18th International Conference on Pattern Recognition (ICPR'06).

[93]  Gaston H. Gonnet,et al.  On the LambertW function , 1996, Adv. Comput. Math..

[94]  Moshe Porat,et al.  On texture and image interpolation using Markov models , 2009, Signal Process. Image Commun..

[95]  Yangsheng Xu,et al.  Human action learning via hidden Markov model , 1997, IEEE Trans. Syst. Man Cybern. Part A.

[96]  Saeed Mozaffari,et al.  Lexicon reduction using dots for off-line Farsi/Arabic handwritten word recognition , 2008, Pattern Recognit. Lett..

[97]  C. Griffin,et al.  Determining a purely symbolic transfer function from symbol streams: Theory and algorithms , 2008, 2008 American Control Conference.

[98]  John G. Proakis,et al.  Probability, random variables and stochastic processes , 1985, IEEE Trans. Acoust. Speech Signal Process..

[99]  Jin Hyung Kim,et al.  Network-based approach to online cursive script recognition , 1999, IEEE Trans. Syst. Man Cybern. Part B.

[100]  Jason M. Schwier,et al.  Markovian Search Games in Heterogeneous Spaces , 2009, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[101]  Jeffrey D. Ullman,et al.  Introduction to Automata Theory, Languages and Computation , 1979 .

[102]  Wu Chou,et al.  Discriminative learning in sequential pattern recognition , 2008, IEEE Signal Processing Magazine.

[103]  Alfred V. Aho,et al.  The Design and Analysis of Computer Algorithms , 1974 .

[104]  Shuqing Wang,et al.  Fault diagnosis in industrial processes using principal component analysis and hidden Markov model , 2004, Proceedings of the 2004 American Control Conference.

[105]  Radford M. Neal Pattern Recognition and Machine Learning , 2007, Technometrics.