Uncovering the Relationship Between Event Log Characteristics and Process Discovery Techniques

The research field of process mining deals with the extraction of knowledge from event logs. Event logs consist of the recording of activities that took place in a certain business environment and as such, one of process mining’s main goals is to get an insight on the execution of business processes. Although a substantial effort has been put on developing techniques which are able to mine event logs accurately, it is still unclear how exactly characteristics of the latter influence a technique’s performance. In this paper, we provide a robust methodology of analysis and subsequently derive useful insights on the role of event log characteristics in process discovery tasks by means of an exhaustive comparative study.

[1]  Cw Christian Günther,et al.  Towards an evaluation framework for process mining algorithms , 2007 .

[2]  Wil M. P. van der Aalst,et al.  Genetic process mining: an experimental evaluation , 2007, Data Mining and Knowledge Discovery.

[3]  Jianmin Wang,et al.  Mining process models with non-free-choice constructs , 2007, Data Mining and Knowledge Discovery.

[4]  Boudewijn F. van Dongen,et al.  A genetic algorithm for discovering process trees , 2012, 2012 IEEE Congress on Evolutionary Computation.

[5]  Wil M.P. van der Aalst,et al.  Genetic Process Mining , 2005, ICATPN.

[6]  Wil M. P. van der Aalst,et al.  Fuzzy Mining - Adaptive Process Simplification Based on Multi-perspective Metrics , 2007, BPM.

[7]  Boudewijn F. van Dongen,et al.  Replaying history on process models for conformance checking and performance analysis , 2012, WIREs Data Mining Knowl. Discov..

[8]  Bart Baesens,et al.  A multi-dimensional quality assessment of state-of-the-art process discovery algorithms using real-life event logs , 2012, Inf. Syst..

[9]  Jianmin Wang,et al.  Efficient Selection of Process Mining Algorithms , 2013, IEEE Transactions on Services Computing.

[10]  Wil M. P. van der Aalst,et al.  Workflow mining: discovering process models from event logs , 2004, IEEE Transactions on Knowledge and Data Engineering.

[11]  Wil M. P. van der Aalst,et al.  The Need for a Process Mining Evaluation Framework in Research and Practice , 2007, Business Process Management Workshops.

[12]  Boudewijn F. van Dongen,et al.  Process mining: a two-step approach to balance between underfitting and overfitting , 2008, Software & Systems Modeling.

[13]  A. J. M. M. Weijters,et al.  Flexible Heuristics Miner (FHM) , 2011, 2011 IEEE Symposium on Computational Intelligence and Data Mining (CIDM).

[14]  S. Geisser,et al.  On methods in the analysis of profile data , 1959 .

[15]  Shaojie Tang,et al.  Aggregation capacity of wireless sensor networks: Extended network case , 2011, 2011 Proceedings IEEE INFOCOM.

[16]  Boudewijn F. van Dongen,et al.  On the Role of Fitness, Precision, Generalization and Simplicity in Process Discovery , 2012, OTM Conferences.

[17]  Boudewijn F. van Dongen,et al.  Causal Nets: A Modeling Language Tailored towards Process Discovery , 2011, CONCUR.

[18]  Luigi Pontieri,et al.  Discovering expressive process models by clustering log traces , 2006, IEEE Transactions on Knowledge and Data Engineering.

[19]  Bart Baesens,et al.  A robust F-measure for evaluating discovered process models , 2011, 2011 IEEE Symposium on Computational Intelligence and Data Mining (CIDM).

[20]  Boudewijn F. van Dongen,et al.  Process Discovery using Integer Linear Programming , 2009, Fundam. Informaticae.

[21]  Behzad Bordbar,et al.  A framework for comparing process mining algorithms , 2011, 2011 IEEE GCC Conference and Exhibition (GCC).

[22]  Janez Demsar,et al.  Statistical Comparisons of Classifiers over Multiple Data Sets , 2006, J. Mach. Learn. Res..

[23]  J. Mauchly Significance Test for Sphericity of a Normal $n$-Variate Distribution , 1940 .

[24]  Boudewijn F. van Dongen,et al.  Alignment Based Precision Checking , 2012, Business Process Management Workshops.

[25]  Jan Mendling,et al.  Process Compliance Measurement Based on Behavioural Profiles , 2010, CAiSE.

[26]  Bart Baesens,et al.  Improved Artificial Negative Event Generation to Enhance Process Event Logs , 2012, CAiSE.

[27]  Boudewijn F. van Dongen,et al.  ProM: The Process Mining Toolkit , 2009, BPM.

[28]  Bart Baesens,et al.  A comprehensive benchmarking framework (CoBeFra) for conformance analysis between procedural process models and event logs in ProM , 2013, 2013 IEEE Symposium on Computational Intelligence and Data Mining (CIDM).

[29]  Bart Baesens,et al.  Robust Process Discovery with Artificial Negative Events , 2009, J. Mach. Learn. Res..

[30]  Boudewijn F. van Dongen,et al.  Cost-Based Fitness in Conformance Checking , 2011, 2011 Eleventh International Conference on Application of Concurrency to System Design.

[31]  H. Huynh,et al.  Estimation of the Box Correction for Degrees of Freedom from Sample Data in Randomized Block and Split-Plot Designs , 1976 .

[32]  Ricardo Seguel,et al.  Process Mining Manifesto , 2011, Business Process Management Workshops.

[33]  Jan Mendling,et al.  Prediction of Business Process Model Quality Based on Structural Metrics , 2010, ER.

[34]  Wil M. P. van der Aalst,et al.  Conformance checking of processes based on monitoring real behavior , 2008, Inf. Syst..

[35]  Jianmin Wang,et al.  On Recommendation of Process Mining Algorithms , 2012, 2012 IEEE 19th International Conference on Web Services.

[36]  Josep Carmona,et al.  A Fresh Look at Precision in Process Conformance , 2010, BPM.

[37]  van der Wmp Wil Aalst,et al.  Process Mining , 2005, Process-Aware Information Systems.