Designing and Evaluating an Interpretable Predictive Modeling Technique for Business Processes

Process mining is a field traditionally concerned with retrospective analysis of event logs, yet interest in applying it online to running process instances is increasing. In this paper, we design a predictive modeling technique that can be used to quantify probabilities of how a running process instance will behave based on the events that have been observed so far. To this end, we study the field of grammatical inference and identify suitable probabilistic modeling techniques for event log data. After tailoring one of these techniques to the domain of business process management, we derive a learning algorithm. By combining our predictive model with an established process discovery technique, we are able to visualize the significant parts of predictive models in form of Petri nets. A preliminary evaluation demonstrates the effectiveness of our approach.

[1]  Jcam Joos Buijs,et al.  Mining process performance from event logs : the BPI Challenge 2012 case study , 2012 .

[2]  Luciano Lavagno,et al.  Petrify: A Tool for Manipulating Concurrent Specifications and Synthesis of Asynchronous Controllers (Special Issue on Asynchronous Circuit and System Design) , 1997 .

[3]  J. Franklin,et al.  The elements of statistical learning: data mining, inference and prediction , 2005 .

[4]  Colin de la Higuera,et al.  Results of the PAutomaC Probabilistic Automaton Learning Competition , 2012, ICGI.

[5]  Colin de la Higuera,et al.  A bibliographical study of grammatical inference , 2005, Pattern Recognit..

[6]  David Barber,et al.  Bayesian reasoning and machine learning , 2012 .

[7]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[8]  Mans Hulden Treba: Efficient Numerically Stable EM for PFA , 2012, ICGI.

[9]  Wil M. P. van der Aalst,et al.  Process Discovery: Capturing the Invisible , 2010, IEEE Comput. Intell. Mag..

[10]  B. F. van Dongen BPI Challenge 2015 , 2015 .

[11]  Vojtech Huser,et al.  Process Mining: Discovery, Conformance and Enhancement of Business Processes , 2012, J. Biomed. Informatics.

[12]  Colin de la Higuera Grammatical Inference: Informed learners , 2010 .

[13]  Radford M. Neal Pattern Recognition and Machine Learning , 2007, Technometrics.

[14]  Olivier Teytaud,et al.  Intelligent Agents for the Game of Go , 2010, IEEE Computational Intelligence Magazine.

[15]  Wil M. P. van der Aalst,et al.  The Application of Petri Nets to Workflow Management , 1998, J. Circuits Syst. Comput..

[16]  Nir Friedman,et al.  Probabilistic Graphical Models - Principles and Techniques , 2009 .

[17]  Galit Shmueli,et al.  Predictive Analytics in Information Systems Research , 2010, MIS Q..

[18]  Wil M. P. van der Aalst,et al.  Time prediction based on process mining , 2011, Inf. Syst..

[19]  Robert Tibshirani,et al.  The Elements of Statistical Learning: Data Mining, Inference, and Prediction, 2nd Edition , 2001, Springer Series in Statistics.

[20]  Wil M. P. van der Aalst,et al.  Process Mining - Discovery, Conformance and Enhancement of Business Processes , 2011 .

[21]  Boudewijn F. van Dongen,et al.  Process mining: a two-step approach to balance between underfitting and overfitting , 2008, Software & Systems Modeling.

[22]  Ricardo Seguel,et al.  Process Mining Manifesto , 2011, Business Process Management Workshops.

[23]  Tommi S. Jaakkola,et al.  On the Dirichlet Prior and Bayesian Regularization , 2002, NIPS.

[24]  H. Akaike,et al.  Information Theory and an Extension of the Maximum Likelihood Principle , 1973 .

[25]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[26]  Arya Adriansyah,et al.  Mining Process Performance from Event Logs , 2012, Business Process Management Workshops.

[27]  Christopher M. Bishop,et al.  A New Framework for Machine Learning , 2008, WCCI.

[28]  Ryo Yoshinaka,et al.  Marginalizing Out Transition Probabilities for Several Subclasses of PFAs , 2012, ICGI.

[29]  Francisco Casacuberta,et al.  Probabilistic finite-state machines - part I , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[30]  Wil M. P. van der Aalst,et al.  Beyond Process Mining: From the Past to Present and Future , 2010, CAiSE.

[31]  Jae-Yoon Jung,et al.  Constructing Decision Trees from Process Logs for Performer Recommendation , 2013, Business Process Management Workshops.

[32]  T. Moon The expectation-maximization algorithm , 1996, IEEE Signal Process. Mag..

[33]  C. D. L. Higuera,et al.  PAutomaC : a PFA / HMM Learning Competition , 2012 .

[34]  Kevin P. Murphy,et al.  Machine learning - a probabilistic perspective , 2012, Adaptive computation and machine learning series.