Recommendation of Process Discovery Algorithms Through Event Log Classification

Process mining is concerned with the extraction of knowledge about business processes from information system logs. Process discovery algorithms are process mining techniques focused on discovering process models starting from event logs. The applicability and effectiveness of process discovery algorithms rely on features of event logs and process characteristics. Selecting a suitable algorithm for an event log is a tough task due to the variety of variables involved in this process. The traditional approaches use empirical assessment in order to recommend a suitable discovery algorithm. This is a time consuming and computationally expensive approach. The present paper evaluates the usefulness of an approach based on classification to recommend discovery algorithms. A knowledge base was constructed, based on features of event logs and process characteristics, in order to train the classifiers. Experimental results obtained with the classifiers evidence the usefulness of the proposal for recommendation of discovery algorithms.

[1]  Sander J. J. Leemans,et al.  Discovering Block-Structured Process Models from Event Logs - A Constructive Approach , 2013, Petri Nets.

[2]  Bart Baesens,et al.  Improved Artificial Negative Event Generation to Enhance Process Event Logs , 2012, CAiSE.

[3]  Boudewijn F. van Dongen,et al.  Process Mining: Overview and Outlook of Petri Net Discovery Algorithms , 2009, Trans. Petri Nets Other Model. Concurr..

[4]  Bart Baesens,et al.  A Critical Evaluation Study of Model-Log Metrics in Process Discovery , 2010, Business Process Management Workshops.

[5]  Wil M. P. van der Aalst,et al.  Transactions on Petri Nets and Other Models of Concurrency II, Special Issue on Concurrency in Process-Aware Information Systems , 2009, Trans. Petri Nets and Other Models of Concurrency.

[6]  Lulu Ma How to Evaluate the Performance of Process Discovery Algorithms , 2012 .

[7]  Boudewijn F. van Dongen,et al.  Process Discovery using Integer Linear Programming , 2009, Fundam. Informaticae.

[8]  Bart Baesens,et al.  Uncovering the Relationship Between Event Log Characteristics and Process Discovery Techniques , 2013, Business Process Management Workshops.

[9]  Bart Baesens,et al.  A comprehensive benchmarking framework (CoBeFra) for conformance analysis between procedural process models and event logs in ProM , 2013, 2013 IEEE Symposium on Computational Intelligence and Data Mining (CIDM).

[10]  Stefanie Rinderle-Ma,et al.  Data Transformation and Semantic Log Purging for Process Mining , 2012, CAiSE.

[11]  Wil M. P. van der Aalst,et al.  Conformance Testing: Measuring the Fit and Appropriateness of Event Logs and Process Models , 2005, Business Process Management Workshops.

[12]  Wil M. P. van der Aalst Process mining , 2012, CACM.

[13]  Wil M. P. van der Aalst,et al.  Conformance checking of processes based on monitoring real behavior , 2008, Inf. Syst..

[14]  Wil M. P. van der Aalst,et al.  Process Mining - Discovery, Conformance and Enhancement of Business Processes , 2011 .

[15]  David J. Spiegelhalter,et al.  Machine Learning, Neural and Statistical Classification , 2009 .

[16]  Bart Baesens,et al.  A multi-dimensional quality assessment of state-of-the-art process discovery algorithms using real-life event logs , 2012, Inf. Syst..

[17]  Boudewijn F. van Dongen,et al.  Process mining: a two-step approach to balance between underfitting and overfitting , 2008, Software & Systems Modeling.

[18]  Jianmin Wang,et al.  Efficient Selection of Process Mining Algorithms , 2013, IEEE Transactions on Services Computing.

[19]  Wil M. P. van der Aalst,et al.  Workflow mining: discovering process models from event logs , 2004, IEEE Transactions on Knowledge and Data Engineering.

[20]  Cw Christian Günther,et al.  Towards an evaluation framework for process mining algorithms , 2007 .

[21]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[22]  Wil M. P. van der Aalst,et al.  Genetic process mining: an experimental evaluation , 2007, Data Mining and Knowledge Discovery.

[23]  Alessandro Sperduti,et al.  PLG: A Framework for the Generation of Business Process Models and Their Execution Logs , 2010, Business Process Management Workshops.

[24]  Anuradha Bhamidipaty,et al.  Process Trace Identification from Unstructured Execution Logs , 2010, 2010 IEEE International Conference on Services Computing.

[25]  Boudewijn F. van Dongen,et al.  Alignment Based Precision Checking , 2012, Business Process Management Workshops.

[26]  Boudewijn F. van Dongen,et al.  XES, XESame, and ProM 6 , 2010, CAiSE Forum.