Efficient Selection of Process Mining Algorithms

While many process mining algorithms have been proposed recently, there does not exist a widely accepted benchmark to evaluate and compare these process mining algorithms. As a result, it can be difficult to choose a suitable process mining algorithm for a given enterprise or application domain. Some recent benchmark systems have been developed and proposed to address this issue. However, evaluating available process mining algorithms against a large set of business models (e.g., in a large enterprise) can be computationally expensive, tedious, and time-consuming. This paper investigates a scalable solution that can evaluate, compare, and rank these process mining algorithms efficiently, and hence proposes a novel framework that can efficiently select the process mining algorithms that are most suitable for a given model set. In particular, using our framework, only a portion of process models need empirical evaluation and others can be recommended directly via a regression model. As a further optimization, this paper also proposes a metric and technique to select high-quality reference models to derive an effective regression model. Experiments using artificial and real data sets show that our approach is practical and outperforms the traditional approach.

[1]  Jianmin Wang,et al.  A Behavioral Similarity Measure between Labeled Petri Nets Based on Principal Transition Sequences - (Short Paper) , 2010, OTM Conferences.

[2]  Jan Mendling,et al.  Detection and prediction of errors in EPCs of the SAP reference model , 2008, Data Knowl. Eng..

[3]  Heng Tao Shen,et al.  Principal Component Analysis , 2009, Encyclopedia of Biometrics.

[4]  A. Jefferson Offutt,et al.  An Empirical Evaluation , 1994 .

[5]  Wil M. P. van der Aalst,et al.  Genetic process mining: an experimental evaluation , 2007, Data Mining and Knowledge Discovery.

[6]  Remco M. Dijkman,et al.  Graph Matching Algorithms for Business Process Model Similarity Search , 2009, BPM.

[7]  Jan Mendling,et al.  Metrics for Process Models: Empirical Foundations of Verification, Error Prediction, and Guidelines for Correctness , 2008, Lecture Notes in Business Information Processing.

[8]  Boudewijn F. van Dongen,et al.  Workflow mining: A survey of issues and approaches , 2003, Data Knowl. Eng..

[9]  Wil M. P. van der Aalst,et al.  The Need for a Process Mining Evaluation Framework in Research and Practice , 2007, Business Process Management Workshops.

[10]  Jan Mendling,et al.  Quality metrics for business process models , 2007 .

[11]  Tadao Murata,et al.  Petri nets: Properties, analysis and applications , 1989, Proc. IEEE.

[12]  Jianmin Wang,et al.  A workflow net similarity measure based on transition adjacency relations , 2010, Comput. Ind..

[13]  Liang-Jie Zhang,et al.  Development of Distance Measures for Process Mining, Discovery and Integration , 2007, Int. J. Web Serv. Res..

[14]  Jianmin Wang,et al.  Mining process models with non-free-choice constructs , 2007, Data Mining and Knowledge Discovery.

[15]  Charles R. Johnson,et al.  Matrix analysis , 1985, Statistical Inference for Engineers and Data Scientists.

[16]  Jan Mendling,et al.  Testing Density as a Complexity Metric for EPCs , 2006 .

[17]  Jianmin Wang,et al.  An empirical evaluation of process mining algorithms based on structural and behavioral similarities , 2012, SAC '12.

[18]  Wil M. P. van der Aalst,et al.  Workflow mining: discovering process models from event logs , 2004, IEEE Transactions on Knowledge and Data Engineering.

[19]  Cw Christian Günther,et al.  Towards an evaluation framework for process mining algorithms , 2007 .

[20]  Wil M. P. van der Aalst,et al.  The Application of Petri Nets to Workflow Management , 1998, J. Circuits Syst. Comput..

[21]  Radford M. Neal Pattern Recognition and Machine Learning , 2007, Technometrics.

[22]  Boudewijn F. van Dongen,et al.  The ProM Framework: A New Era in Process Mining Tool Support , 2005, ICATPN.

[23]  Jan Mendling,et al.  What Makes Process Models Understandable? , 2007, BPM.

[24]  Josep Carmona,et al.  A Region-Based Algorithm for Discovering Petri Nets from Event Logs , 2008, BPM.

[25]  Jörg Desel,et al.  ''What Is a Petri Net?'' , 2001, Unifying Petri Nets.

[26]  Remco M. Dijkman,et al.  Similarity Search of Business Process Models , 2009, IEEE Data Eng. Bull..

[27]  Jan Mendling,et al.  Faulty EPCs in the SAP Reference Model , 2006, Business Process Management.

[28]  Jianmin Wang,et al.  On Recommendation of Process Mining Algorithms , 2012, 2012 IEEE 19th International Conference on Web Services.

[29]  Ralf Laue,et al.  Analysing Differences between Business Process Similarity Measures , 2011, Business Process Management Workshops.

[30]  Tao Jin,et al.  Efficiently Querying Business Process Models with BeehiveZ , 2011, BPM.

[31]  Remco M. Dijkman,et al.  Similarity of business process models: Metrics and evaluation , 2011, Inf. Syst..

[32]  Ricardo Seguel,et al.  Process Mining Manifesto , 2011, Business Process Management Workshops.

[33]  Wil M.P. van der Aalst,et al.  Process mining with the HeuristicsMiner algorithm , 2006 .