Partial Order Resolution of Event Logs for Process Conformance Checking

Abstract While supporting the execution of business processes, information systems record event logs. Conformance checking relies on these logs to analyze whether the recorded behavior of a process conforms to the behavior of a normative specification. A key assumption of existing conformance checking techniques, however, is that all events are associated with timestamps that allow to infer a total order of events per process instance. Unfortunately, this assumption is often violated in practice. Due to synchronization issues, manual event recordings, or data corruption, events are only partially ordered. In this paper, we put forward the problem of partial order resolution of event logs to close this gap. It refers to the construction of a probability distribution over all possible total orders of events of an instance. To cope with the order uncertainty in real-world data, we present several estimators for this task, incorporating different notions of behavioral abstraction. Moreover, to reduce the runtime of conformance checking based on partial order resolution, we introduce an approximation method that comes with a bounded error in terms of accuracy. Our experiments with real-world and synthetic data reveal that our approach improves accuracy over the state-of-the-art considerably.

[1]  Neil Immerman,et al.  Recognizing patterns in streams with imprecise timestamps , 2010, Proc. VLDB Endow..

[2]  Hajo A. Reijers,et al.  Efficient Process Conformance Checking on the Basis of Uncertain Event-to-Activity Mappings , 2020, IEEE Transactions on Knowledge and Data Engineering.

[3]  Wil M. P. van der Aalst,et al.  Process Mining in Healthcare: Data Challenges When Answering Frequently Posed Questions , 2012, ProHealth/KR4HC.

[4]  Shazia Wasim Sadiq,et al.  Compliance Aware Business Process Design , 2007, Business Process Management Workshops.

[5]  Wil M. P. van der Aalst,et al.  Single-Entry Single-Exit decomposed conformance checking , 2014, Inf. Syst..

[6]  Yanlei Diao,et al.  Supporting Data Uncertainty in Array Databases , 2015, SIGMOD Conference.

[7]  A. H. M. T. Hofstedea,et al.  Event log imperfection patterns for process mining : Towards a systematic approach to cleaning event logs , 2016 .

[8]  Boudewijn F. van Dongen,et al.  Conformance Checking Using Cost-Based Fitness Analysis , 2011, 2011 IEEE 15th International Enterprise Distributed Object Computing Conference.

[9]  Boudewijn F. van Dongen Efficiently Computing Alignments - Using the Extended Marking Equation , 2018, BPM.

[10]  Thorsten Joachims,et al.  Playlist prediction via metric embedding , 2012, KDD.

[11]  Josep Carmona,et al.  A Recursive Paradigm for Aligning Observed Behavior of Large Structured Process Models , 2016, BPM.

[12]  Moe Thandar Wynn,et al.  Detection and Interactive Repair of Event Ordering Imperfection in Process Logs , 2018, CAiSE.

[13]  Benoît Depaire,et al.  Generating Artificial Data for Empirical Analysis of Control-flow Discovery Algorithms , 2019, Bus. Inf. Syst. Eng..

[14]  Mathias Weske,et al.  Bridging abstraction layers in process mining , 2014, Inf. Syst..

[15]  Christopher Ré,et al.  Event queries on correlated probabilistic streams , 2008, SIGMOD Conference.

[16]  Matthias Weidlich,et al.  Interval-based Queries over Lossy IoT Event Streams , 2020 .

[17]  Hajo A. Reijers,et al.  Checking process compliance against natural language specifications using behavioral spaces , 2018, Inf. Syst..

[18]  Akhil Kumar,et al.  Conceptual model for online auditing , 2011, Decis. Support Syst..

[19]  Mathias Weske,et al.  Prediction of Remaining Service Execution Time Using Stochastic Petri Nets with Arbitrary Firing Delays , 2013, ICSOC.

[20]  Philip S. Yu,et al.  A Survey of Uncertain Data Algorithms and Applications , 2009, IEEE Transactions on Knowledge and Data Engineering.

[21]  Marlon Dumas,et al.  Scalable Conformance Checking of Business Processes , 2017, OTM Conferences.

[22]  Wil M. P. van der Aalst,et al.  Wanna improve process mining results? , 2013, 2013 IEEE Symposium on Computational Intelligence and Data Mining (CIDM).

[23]  Jan Mendling,et al.  The ROAD from Sensor Data to Process Instances via Interaction Mining , 2016, CAiSE.

[24]  Eric Koskinen,et al.  BorderPatrol: isolating events for black-box tracing , 2008, Eurosys '08.

[25]  M. Rosenblatt A CENTRAL LIMIT THEOREM AND A STRONG MIXING CONDITION. , 1956, Proceedings of the National Academy of Sciences of the United States of America.

[26]  A. Agresti,et al.  Approximate is Better than “Exact” for Interval Estimation of Binomial Proportions , 1998 .

[27]  Sander J. J. Leemans,et al.  Discovering Block-Structured Process Models from Event Logs - A Constructive Approach , 2013, Petri Nets.

[28]  Massimiliano de Leoni,et al.  How Planning Techniques Can Help Process Mining: The Conformance-Checking Case , 2017, SEBD.

[29]  Sharon L. Lohr,et al.  Sampling: Design and Analysis , 1999 .

[30]  Sander J. J. Leemans,et al.  Scalable process discovery and conformance checking , 2016, Software & Systems Modeling.

[31]  Michael Philippsen,et al.  Reliable speculative processing of out-of-order event streams in generic publish/subscribe middlewares , 2013, DEBS '13.

[32]  B. F. van Dongen BPI Challenge 2014 , 2014 .

[33]  Matthias Weidlich,et al.  Estimating Process Conformance by Trace Sampling and Result Approximation , 2019, BPM.

[34]  Carlo Batini,et al.  Data Quality: Concepts, Methodologies and Techniques (Data-Centric Systems and Applications) , 2006 .

[35]  Wil M. P. van der Aalst,et al.  Process Discovery and Conformance Checking Using Passages , 2014, Fundam. Informaticae.

[36]  Wil M. P. van der Aalst,et al.  Process Mining , 2016, Springer Berlin Heidelberg.

[37]  Bin Jiang,et al.  Probabilistic Skylines on Uncertain Data , 2007, VLDB.

[38]  Bart Baesens,et al.  Comprehensive rule-based compliance checking and risk management with process mining , 2013, Decis. Support Syst..

[39]  Wil M. P. van der Aalst,et al.  Event interval analysis: Why do processes take time? , 2015, Decis. Support Syst..

[40]  C. Humby,et al.  Process Mining: Data science in Action , 2014 .

[41]  Jian Pei,et al.  A brief survey on sequence classification , 2010, SKDD.

[42]  Anne Beaudry,et al.  Impacts of IT Acceptance and Resistance Behaviors: A Novel Framework , 2013, ICIS.

[43]  Josep Carmona,et al.  Conformance Checking - Relating Processes and Models , 2018 .

[44]  Fernando Pereira,et al.  Weighted finite-state transducers in speech recognition , 2002, Comput. Speech Lang..

[45]  Jennifer Widom,et al.  Making Aggregation Work in Uncertain and Probabilistic Databases , 2011, IEEE Transactions on Knowledge and Data Engineering.

[46]  Prashant J. Shenoy,et al.  Probabilistic Inference over RFID Streams in Mobile Environments , 2009, 2009 IEEE 25th International Conference on Data Engineering.

[47]  Moe Thandar Wynn,et al.  Event log imperfection patterns for process mining: Towards a systematic approach to cleaning event logs , 2017, Inf. Syst..

[48]  Dirk Fahland,et al.  Conformance checking in healthcare based on partially ordered event data , 2014, Proceedings of the 2014 IEEE Emerging Technology and Factory Automation (ETFA).