Process Discovery Using Classification Tree Hidden Semi-Markov Model

Various and ubiquitous information systems are being used in monitoring, exchanging, and collecting information. These systems are generating massive amount of event sequence logs that may help us understand underlying phenomenon. By analyzing these logs, we can learn process models that describe system procedures, predict the development of the system, or check whether the changes are expected. In this paper, we consider a novel technique that models these sequences of events in temporal-probabilistic manners. Specifically, we propose a probabilistic process model that combines hidden semi-Markov model and classification trees learning. Our experimental result shows that the proposed approach can answer a kind of question-"what are the most frequent sequence of system dynamics relevant to a given sequence of observable events?". For example, "Given a series of medical treatments, what are the most relevant patients' health condition pattern changes at different times?".

[1]  L. Rabiner,et al.  An introduction to hidden Markov models , 1986, IEEE ASSP Magazine.

[2]  Wil M. P. van der Aalst,et al.  Process Aware Information Systems: Bridging People and Software Through Process Technology , 2005 .

[3]  C. D. Kemp,et al.  Density Estimation for Statistics and Data Analysis , 1987 .

[4]  Nikolaos Limnios,et al.  Semi-Markov Chains and Hidden Semi-Markov Models toward Applications: Their Use in Reliability and DNA Analysis , 2008 .

[5]  Christos G. Cassandras,et al.  Introduction to Discrete Event Systems , 1999, The Kluwer International Series on Discrete Event Dynamic Systems.

[6]  M. Anusha,et al.  Big Data-Survey , 2016 .

[7]  Yunhao Liu,et al.  Big Data: A Survey , 2014, Mob. Networks Appl..

[8]  J. Sansom,et al.  Fitting hidden semi-Markov models to breakpoint rainfall data , 2001, Journal of Applied Probability.

[9]  Søren Højsgaard,et al.  Hidden Semi Markov Models for Multiple Observation Sequences: The mhsmm Package for R , 2011 .

[10]  Amy Nicole Langville,et al.  Google's PageRank and beyond - the science of search engine rankings , 2006 .

[11]  Eamonn J. Keogh,et al.  A Novel Bit Level Time Series Representation with Implication of Similarity Search and Clustering , 2005, PAKDD.

[12]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[13]  Leo Breiman,et al.  Classification and Regression Trees , 1984 .

[14]  Wil M. P. van der Aalst,et al.  Process Mining - Discovery, Conformance and Enhancement of Business Processes , 2011 .

[15]  I K Fodor,et al.  A Survey of Dimension Reduction Techniques , 2002 .

[16]  Wil vanderAalst,et al.  Workflow Management: Models, Methods, and Systems , 2004 .

[17]  M. Ostendorf,et al.  Maximum likelihood successive state splitting , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[18]  Lalit R. Bahl,et al.  Maximum mutual information estimation of hidden Markov model parameters for speech recognition , 1986, ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[19]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[20]  Nir Friedman,et al.  Probabilistic Graphical Models , 2009, Data-Driven Computational Neuroscience.

[21]  E. LESTER SMITH,et al.  AND OTHERS , 2005 .

[22]  Vladimir Zadorozhny,et al.  Process monitoring using maximum sequence divergence , 2016, Knowledge and Information Systems.

[23]  Jr. G. Forney,et al.  The viterbi algorithm , 1973 .

[24]  T. Therneau,et al.  An Introduction to Recursive Partitioning Using the RPART Routines , 2015 .