Generalized Alignment-Based Trace Clustering of Process Behavior

Process mining techniques use event logs containing real process executions in order to mine, align and extend process models. The partition of an event log into trace variants facilitates the understanding and analysis of traces, so it is a common pre-processing in process mining environments. Trace clustering automates this partition; traditionally it has been applied without taking into consideration the availability of a process model. In this paper we extend our previous work on process model based trace clustering, by allowing cluster centroids to have a complex structure, that can range from a partial order, down to a subnet of the initial process model. This way, the new clustering framework presented in this paper is able to cluster together traces that are distant only due to concurrency or loop constructs in process models. We show the complexity analysis of the different instantiations of the trace clustering framework, and have implemented it in a prototype tool that has been tested on different datasets.

[1]  J. C. Dunn,et al.  A Fuzzy Relative of the ISODATA Process and Its Use in Detecting Compact Well-Separated Clusters , 1973 .

[2]  Bart Baesens,et al.  Active Trace Clustering for Improved Process Discovery , 2013, IEEE Transactions on Knowledge and Data Engineering.

[3]  Luigi Pontieri,et al.  Discovering expressive process models by clustering log traces , 2006, IEEE Transactions on Knowledge and Data Engineering.

[4]  Josep Carmona,et al.  Conformance Checking , 2018, Springer International Publishing.

[5]  Joost Engelfriet,et al.  Branching processes of Petri nets , 1991, Acta Informatica.

[6]  Michael Benedikt,et al.  Regular Repair of Specifications , 2011, 2011 IEEE 26th Annual Symposium on Logic in Computer Science.

[7]  Josep Carmona,et al.  POD - A Tool For Process Discovery Using Partial Orders and Independence Information , 2015, BPM.

[8]  Dirk Fahland,et al.  Detecting Deviating Behaviors Without Models , 2015, Business Process Management Workshops.

[9]  Wil M. P. van der Aalst,et al.  Context Aware Trace Clustering: Towards Improving Process Mining Results , 2009, SDM.

[10]  Wil M. P. van der Aalst,et al.  Process Mining , 2016, Springer Berlin Heidelberg.

[11]  Josep Carmona,et al.  Alignment-Based Trace Clustering , 2017, ER.

[12]  Jens Palsberg,et al.  Complexity Results for 1-safe Nets , 1993, FSTTCS.

[13]  Marielba Zacarias,et al.  Approaching Process Mining with Sequence Clustering: Experiments and Findings , 2007, BPM.

[14]  Josep Carmona,et al.  Unfolding-Based Process Discovery , 2015, ATVA.

[15]  Slawomir Lasota,et al.  The Reachability Problem for Petri Nets Is Not Elementary , 2018, J. ACM.

[16]  van der Wmp Wil Aalst,et al.  Discovering deviating cases and process variants using trace clustering , 2015 .

[17]  Niklas Sörensson,et al.  Translating Pseudo-Boolean Constraints into SAT , 2006, J. Satisf. Boolean Model. Comput..

[18]  Iain A. Stewart Reachability in Some Classes of Acyclic Petri Nets , 1995, Fundam. Informaticae.

[19]  Jordi Cortadella,et al.  Mining structured petri nets for the visualization of process behavior , 2016, SAC.

[20]  Teofilo F. GONZALEZ,et al.  Clustering to Minimize the Maximum Intercluster Distance , 1985, Theor. Comput. Sci..

[21]  Dirk Fahland,et al.  Conformance Checking Based on Partially Ordered Event Data , 2014, Business Process Management Workshops.

[22]  Josep Carmona,et al.  Model and Event Log Reductions to Boost the Computation of Alignments , 2016, SIMPDA.

[23]  Tadao Murata,et al.  Petri nets: Properties, analysis and applications , 1989, Proc. IEEE.

[24]  P. Rousseeuw Silhouettes: a graphical aid to the interpretation and validation of cluster analysis , 1987 .

[25]  Pavel Berkhin,et al.  A Survey of Clustering Data Mining Techniques , 2006, Grouping Multidimensional Data.

[26]  Wil M. P. van der Aalst,et al.  Trace Clustering in Process Mining , 2008, Business Process Management Workshops.

[27]  Wil M. P. van der Aalst,et al.  Trace Clustering Based on Conserved Patterns: Towards Achieving Better Process Models , 2009, Business Process Management Workshops.

[28]  Hongyan Ma,et al.  Process-aware information systems: Bridging people and software through process technology , 2007, J. Assoc. Inf. Sci. Technol..

[29]  Jordi Cortadella,et al.  Process Windows , 2017, 2017 17th International Conference on Application of Concurrency to System Design (ACSD).