A Framework for Efficient Memory Utilization in Online Conformance Checking

Conformance checking (CC) techniques of the processmining field gauge the conformance of the sequence of events in a case with respect to a business process model, which simply put is an amalgam of certain behavioral relations or rules. Online conformance checking (OCC) techniques are tailored for assessing such conformance on streaming events. The realistic assumption of having a finite memory for storing the streaming events has largely not been considered by the OCC techniques. We propose three incremental approaches to reduce the memory consumption in prefixalignment-based OCC techniques along with ensuring that we incur a minimum loss of the conformance insights. Our first proposed approach bounds the number of maximum states that constitute a prefix-alignment to be retained by any case in memory. The second proposed approach bounds the number of cases that are allowed to retain more than a single state, referred to as multistate cases. Building on top of the two proposed approaches, our third approach further bounds the number of maximum states that the multi-state cases can retain. All these approaches forget the states in excess to their defined limits and retain a meaningful summary of them. Computing prefix-alignments in the future is then resumed for such cases from the current position contained in the summary. We highlight the superiority of all proposed approaches compared to a state of the art prefix-alignment-based OCC technique through experiments using real-life event data under a streaming setting. Our approaches substantially reducememory consumption by up to 80% on average, while introducing a minor accuracy drop.

[1]  Ricardo Seguel,et al.  Process Mining Manifesto , 2011, Business Process Management Workshops.

[2]  A Arya Adriansyah,et al.  Aligning observed and modeled behavior , 2014 .

[3]  Josep Carmona,et al.  A Framework for Online Conformance Checking , 2017, Business Process Management Workshops.

[4]  Marwan Hassani,et al.  Online conformance checking: relating event streams to process models using prefix-alignments , 2017, International Journal of Data Science and Analytics.

[5]  Wil M. P. van der Aalst,et al.  Recomposing conformance: Closing the circle on decomposed alignment-based conformance checking in process mining , 2018, Inf. Sci..

[6]  van der Wmp Wil Aalst,et al.  An iterative algorithm for applying the theory of regions in process mining , 2007 .

[7]  Boudewijn F. van Dongen,et al.  Cost-Based Fitness in Conformance Checking , 2011, 2011 Eleventh International Conference on Application of Concurrency to System Design.

[8]  S. J. van Zelst,et al.  Process mining with streaming data , 2019 .

[9]  Wil M. P. van der Aalst,et al.  Decomposing Petri nets for process mining: A generic approach , 2013, Distributed and Parallel Databases.

[10]  Wil M. P. van der Aalst,et al.  Data Science in Action , 2016 .

[11]  Boudewijn F. van Dongen,et al.  Controlling Break-the-Glass through Alignment , 2013, 2013 International Conference on Social Computing.

[12]  Andrea Burattin,et al.  Orientation and conformance: A HMM-based approach to online conformance checking , 2020, Inf. Syst..

[13]  Marwan Hassani,et al.  Prefix Imputation of Orphan Events in Event Stream Processing , 2021, Frontiers in Big Data.

[14]  Josep Carmona,et al.  Online Conformance Checking Using Behavioural Patterns , 2018, BPM.

[15]  S. Sakr,et al.  Streaming process discovery and conformance checking , 2018 .

[16]  Heitor Murilo Gomes,et al.  Data stream analysis: Foundations, major tasks and tools , 2021, WIREs Data Mining Knowl. Discov..

[17]  Wil M. P. van der Aalst,et al.  On the application of sequential pattern mining primitives to process discovery: Overview, outlook and opportunity identification , 2019, WIREs Data Mining Knowl. Discov..

[18]  Marwan Hassani,et al.  Efficient clustering of big data streams , 2015 .

[19]  João Gama,et al.  Machine learning for streaming data: state of the art, challenges, and opportunities , 2019, SKDD.