Repairing Event Logs Using Stochastic Process Models

Companies strive to improve their business processes in order to remain competitive. Process mining aims to infer meaningful insights from process-related data and attracted the attention of practitioners, tool-vendors, and researchers in recent years. Traditionally, event logs are assumed to describe the as-is situation. But this is not necessarily the case in environments where logging may be compromised due to manual logging. For example, hospital staff may need to manually enter information regarding the patient’s treatment. As a result, events or timestamps may be missing or incorrect. In this report, we make use of process knowledge captured in process models, and provide a method to repair missing events in the logs. This way, we facilitate analysis of incomplete logs. We realize the repair by combining stochastic Petri nets, alignments, and Bayesian networks. We evaluate the results using both synthetic data and real event data from a Dutch hospital.

[1]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[2]  Marco Ajmone Marsan,et al.  A class of generalized stochastic Petri nets for the performance evaluation of multiprocessor systems , 1984, TOCS.

[3]  Eike Best Structure Theory of Petri Nets: the Free Choice Hiatus , 1986 .

[4]  Roger A. Sugden,et al.  Multiple Imputation for Nonresponse in Surveys , 1988 .

[5]  D. Rubin,et al.  Statistical Analysis with Missing Data , 1988 .

[6]  Marco Ajmone Marsan,et al.  The Effect of Execution Policies on the Semantics and Analysis of Stochastic Petri Nets , 1989, IEEE Trans. Software Eng..

[7]  Judea Pearl,et al.  Probabilistic reasoning in intelligent systems - networks of plausible inference , 1991, Morgan Kaufmann series in representation and reasoning.

[8]  Gregory F. Cooper,et al.  The Computational Complexity of Probabilistic Inference Using Bayesian Belief Networks , 1990, Artif. Intell..

[9]  Gianfranco Ciardo,et al.  A Characterization of the Stochastic Process Underlying a Stochastic Petri Net , 1994, IEEE Trans. Software Eng..

[10]  Wil M. P. van der Aalst,et al.  Verification of Workflow Nets , 1997, ICATPN.

[11]  Kevin Murphy,et al.  Bayes net toolbox for Matlab , 1999 .

[12]  J. Schafer,et al.  Missing data: our view of the state of the art. , 2002, Psychological methods.

[13]  Prakash P. Shenoy Inference in Hybrid Bayesian Networks Using Mixtures of Gaussians , 2006, UAI.

[14]  Yu Hen Hu,et al.  Statistical static timing analysis with conditional linear MAX/MIN approximation and extended canonical timing model , 2006, IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems.

[15]  Wil M.P. van der Aalst,et al.  Fuzzy Mining - Adaptive Process Simplification Based on Multi-perspective Metrics , 2007, BPM.

[16]  Nir Friedman,et al.  Probabilistic Graphical Models - Principles and Techniques , 2009 .

[17]  Boudewijn F. van Dongen,et al.  Conformance Checking Using Cost-Based Fitness Analysis , 2011, 2011 IEEE 15th International Enterprise Distributed Object Computing Conference.

[18]  Haiyang Hu,et al.  A Novel Approach for Mining Stochastic Process Model from Workflow Logs , 2011 .

[19]  William J. Knottenbelt,et al.  Deriving generalised stochastic Petri net performance models from high-precision location tracking data , 2011, VALUETOOLS.

[20]  Ricardo Seguel,et al.  Process Mining Manifesto , 2011, Business Process Management Workshops.

[21]  Wil M. P. van der Aalst,et al.  Process Mining - Discovery, Conformance and Enhancement of Business Processes , 2011 .

[22]  Jan Vanthienen,et al.  IEEE Task force on process mining , 2011 .

[23]  Boudewijn F. van Dongen,et al.  Improving Business Process Models Using Observed Behavior , 2012, SIMPDA.

[24]  Mathias Weske,et al.  Embedding Conformance Checking in a Process Intelligence System in Hospital Environments , 2012, ProHealth/KR4HC.

[25]  Boudewijn F. van Dongen,et al.  Replaying history on process models for conformance checking and performance analysis , 2012, WIREs Data Mining Knowl. Discov..

[26]  Dirk Fahland,et al.  Repairing Process Models to Reflect Reality , 2012, BPM.

[27]  Mathias Weske,et al.  Discovering Stochastic Petri Nets with Arbitrary Delay Distributions from Event Logs , 2013, Business Process Management Workshops.

[28]  Piergiorgio Bertoli,et al.  Reasoning-Based Techniques for Dealing with Incomplete Business Process Execution Traces , 2013, AI*IA.