Metaheuristic Optimization for Automated Business Process Discovery

The problem of automated discovery of process models from event logs has been intensely investigated in the past two decades, leading to a range of approaches that strike various trade-offs between accuracy, model complexity, and execution time. A few studies have suggested that the accuracy of automated process discovery approaches can be enhanced by using metaheuristic optimization. However, these studies have remained at the level of proposals without validation on real-life logs or they have only considered one metaheuristics in isolation. In this setting, this paper studies the following question: To what extent can the accuracy of automated process discovery approaches be improved by applying different optimization metaheuristics? To address this question, the paper proposes an approach to enhance automated process discovery approaches with metaheuristic optimization. The approach is instantiated to define an extension of a state-of-the-art automated process discovery approach, namely Split Miner. The paper compares the accuracy gains yielded by four optimization metaheuristics relative to each other and relative to state-of-the-art baselines, on a benchmark comprising 20 real-life logs. The results show that metaheuristic optimization improves the accuracy of Split Miner in a majority of cases, at the cost of execution times in the order of minutes, versus seconds for the base algorithm.

[1]  C. D. Gelatt,et al.  Optimization by Simulated Annealing , 1983, Science.

[2]  Josep Carmona,et al.  A Method for Assessing Parameter Impact on Control-Flow Discovery Algorithms , 2015, ATAED@Petri Nets/ACSD.

[3]  Sander J. J. Leemans,et al.  Discovering Block-Structured Process Models from Event Logs Containing Infrequent Behaviour , 2013, Business Process Management Workshops.

[4]  Wei Song,et al.  Business Process Mining Based on Simulated Annealing , 2008, 2008 The 9th International Conference for Young Computer Scientists.

[5]  Fred W. Glover,et al.  Future paths for integer programming and links to artificial intelligence , 1986, Comput. Oper. Res..

[6]  Thomas Stützle,et al.  Local search algorithms for combinatorial problems - analysis, improvements, and new applications , 1999, DISKI.

[7]  Marlon Dumas,et al.  Abstract-and-Compare: A Family of Scalable Precision Measures for Automated Process Discovery , 2018, BPM.

[8]  Seppe K. L. M. vanden Broucke,et al.  Fodina: A robust and flexible heuristic process discovery technique , 2017, Decis. Support Syst..

[9]  C. B. Pop,et al.  Hybrid Particle Swarm Optimization method for process mining , 2012, 2012 IEEE 8th International Conference on Intelligent Computer Communication and Processing.

[10]  Boudewijn F. van Dongen,et al.  Conformance Checking Using Cost-Based Fitness Analysis , 2011, 2011 IEEE 15th International Enterprise Distributed Object Computing Conference.

[11]  Somayeh Alizadeh,et al.  ICMA: a new efficient algorithm for process model discovery , 2018, Applied Intelligence.

[12]  Sander J. J. Leemans,et al.  Scalable process discovery and conformance checking , 2016, Software & Systems Modeling.

[13]  Alessandro Sperduti,et al.  Automatic determination of parameters' values for Heuristics Miner++ , 2010, IEEE Congress on Evolutionary Computation.

[14]  Boudewijn F. van Dongen,et al.  Measuring precision of modeled behavior , 2015, Inf. Syst. E Bus. Manag..

[15]  Marlon Dumas,et al.  Automated discovery of structured process models from event logs: The discover-and-structure approach , 2017, Data Knowl. Eng..

[16]  Patrick Siarry,et al.  A survey on optimization metaheuristics , 2013, Inf. Sci..

[17]  Marlon Dumas,et al.  Measuring Fitness and Precision of Automatically Discovered Process Models: A Principled and Scalable Approach , 2022, IEEE Transactions on Knowledge and Data Engineering.

[18]  A. J. M. M. Weijters,et al.  Flexible Heuristics Miner (FHM) , 2011, 2011 IEEE Symposium on Computational Intelligence and Data Mining (CIDM).

[19]  Marlon Dumas,et al.  Split miner: automated discovery of accurate and simple business process models from event logs , 2019, Knowledge and Information Systems.

[20]  Qiang Liu,et al.  An improved simulated annealing algorithm for process mining , 2009, 2009 13th International Conference on Computer Supported Cooperative Work in Design.

[21]  Wil M.P. van der Aalst,et al.  Genetic Process Mining , 2005, ICATPN.

[22]  Massimo Mecella,et al.  Automated Discovery of Process Models from Event Logs: Review and Benchmark , 2017, IEEE Transactions on Knowledge and Data Engineering.

[23]  Arthur H. M. ter Hofstede,et al.  Filtering Out Infrequent Behavior from Business Process Event Logs , 2017, IEEE Transactions on Knowledge and Data Engineering.

[24]  Boudewijn F. van Dongen,et al.  On the Role of Fitness, Precision, Generalization and Simplicity in Process Discovery , 2012, OTM Conferences.

[25]  C. Humby,et al.  Process Mining: Data science in Action , 2014 .