Fusion-Based Process Discovery

Information systems record the execution of transactions as part of business processes in event logs. Process mining analyses such event logs, e.g., by discovering process models. Recently, various discovery algorithms have been proposed, each with specific advantages and limitations. In this work, we argue that, instead of relying on a single algorithm, the outcomes of different algorithms shall be fused to combine the strengths of individual approaches. We propose a general framework for such fusion and instantiate it with two new discovery algorithms: The Exhaustive Noise-aware Inductive Miner (exNoise), which, exhaustively searches for model improvements; and the Adaptive Noise-aware Inductive Miner (adaNoise), a computationally tractable version of exNoise. For both algorithms, we formally show that they outperform each of the individual mining algorithms used by them. Our empirical evaluation further illustrates that fusion-based discovery yields models of better quality than state-of-the-art approaches.

[1]  Sander J. J. Leemans,et al.  Discovering Block-Structured Process Models from Event Logs Containing Infrequent Behaviour , 2013, Business Process Management Workshops.

[2]  Jianmin Wang,et al.  Mining process models with non-free-choice constructs , 2007, Data Mining and Knowledge Discovery.

[3]  Josep Carmona,et al.  A Fresh Look at Precision in Process Conformance , 2010, BPM.

[4]  Wolfgang Reisig,et al.  Understanding Petri nets , 1995, IEEE Parallel & Distributed Technology: Systems & Applications.

[5]  Boudewijn F. van Dongen,et al.  A genetic algorithm for discovering process trees , 2012, 2012 IEEE Congress on Evolutionary Computation.

[6]  Sander J. J. Leemans,et al.  Discovering Block-Structured Process Models from Event Logs - A Constructive Approach , 2013, Petri Nets.

[7]  Geert Poels,et al.  Improving the quality of the Heuristics Miner in ProM 6.2 , 2014, Expert Syst. Appl..

[8]  Wolfgang Reisig,et al.  Understanding Petri Nets Modeling Techniques, Analysis Methods, Case Studies , 2013, Bull. EATCS.

[9]  A. J. M. M. Weijters,et al.  Flexible Heuristics Miner (FHM) , 2011, 2011 IEEE Symposium on Computational Intelligence and Data Mining (CIDM).

[10]  Wil M. P. van der Aalst,et al.  Single-Entry Single-Exit decomposed conformance checking , 2014, Inf. Syst..

[11]  Sander J. J. Leemans,et al.  Scalable Process Discovery with Guarantees , 2015, BMMDS/EMMSAD.

[12]  Marlon Dumas,et al.  Split Miner: Discovering Accurate and Simple Business Process Models from Event Logs , 2017, 2017 IEEE International Conference on Data Mining (ICDM).

[13]  Josep Carmona,et al.  New Region-Based Algorithms for Deriving Bounded Petri Nets , 2010, IEEE Transactions on Computers.

[14]  Marlon Dumas,et al.  Process Mining Reloaded: Event Structures as a Unified Representation of Process Models and Event Logs , 2015, Petri Nets.

[15]  Trevor Hastie,et al.  The Elements of Statistical Learning , 2001 .

[16]  Avigdor Gal,et al.  EDU-ProM: ProM for the Classroom , 2017, BPM.

[17]  Josep Carmona,et al.  Alignment-Based Trace Clustering , 2017, ER.

[18]  Boudewijn F. van Dongen,et al.  Process Discovery using Integer Linear Programming , 2009, Fundam. Informaticae.

[19]  Jan Mendling,et al.  In Log and Model We Trust? A Generalized Conformance Checking Framework , 2016, BPM.

[20]  Wil M. P. van der Aalst,et al.  On the Representational Bias in Process Mining , 2011, 2011 IEEE 20th International Workshops on Enabling Technologies: Infrastructure for Collaborative Enterprises.

[21]  Matthias Weidlich,et al.  Compound Trace Clustering to Generate Accurate and Simple Sub-Process Models , 2017, ICSOC.

[22]  Wil M.P. van der Aalst,et al.  Genetic Process Mining , 2005, ICATPN.

[23]  Massimo Mecella,et al.  Automated Discovery of Process Models from Event Logs: Review and Benchmark , 2017, IEEE Transactions on Knowledge and Data Engineering.

[24]  Boudewijn F. van Dongen,et al.  Quality Dimensions in Process Discovery: The Importance of Fitness, Precision, Generalization and Simplicity , 2014, Int. J. Cooperative Inf. Syst..

[25]  Ichiro Suzuki,et al.  A Method for Stepwise Refinement and Abstraction of Petri Nets , 1983, J. Comput. Syst. Sci..

[26]  Boudewijn F. van Dongen,et al.  Mining Configurable Process Models from Collections of Event Logs , 2013, BPM.

[27]  Wil M. P. van der Aalst,et al.  Process Discovery and Conformance Checking Using Passages , 2014, Fundam. Informaticae.

[28]  Marlon Dumas,et al.  Slice, Mine and Dice: Complexity-Aware Automated Discovery of Business Process Models , 2013, BPM.

[29]  Boudewijn F. van Dongen,et al.  Replaying history on process models for conformance checking and performance analysis , 2012, WIREs Data Mining Knowl. Discov..

[30]  Bart Baesens,et al.  Active Trace Clustering for Improved Process Discovery , 2013, IEEE Transactions on Knowledge and Data Engineering.