Correlation Miner: Mining Business Process Models and Event Correlations Without Case Identifiers

Process discovery algorithms aim to capture process models from event logs. These algorithms have been designed for logs in which the events that belong to the same case are related to each other — and to that case — by means of a unique case identifier. However, in service-oriented systems, these case identifiers are rarely stored beyond request-response pairs, which makes it hard to relate events that belong to the same case. This is known as the correlation challenge. This paper addresses the correlation challenge by introducing a technique, called the correlation miner, that facilitates discovery of business process models when events are not associated with a case identifier. It extends previous work on the correlation miner, by not only enabling the discovery of the process model, but also detecting which events belong to the same case. Experiments performed on both synthetic and real-world event logs show the applicability of the correlation miner. The resulting technique enables us to observe a service-oriented system and determine — with high accuracy — which request-response pairs sent by different communicating parties are related to each other.

[1]  Wil M. P. van der Aalst,et al.  Conformance checking of processes based on monitoring real behavior , 2008, Inf. Syst..

[2]  San-Yih Hwang,et al.  On the discovery of process models from their instances , 2002, Decis. Support Syst..

[3]  Jan Mendling,et al.  What Makes Process Models Understandable? , 2007, BPM.

[4]  Walid Gaaloul,et al.  Log-based mining techniques applied to Web service composition reengineering , 2008, Service Oriented Computing and Applications.

[5]  Wil M. P. van der Aalst,et al.  Challenges in Service Mining: Record, Check, Discover , 2013, ICWE.

[6]  Markus Hammori,et al.  Interactive Workflow Mining , 2004, Business Process Management.

[7]  Athman Bouguettaya,et al.  Service Mining on the Web , 2009, IEEE Transactions on Services Computing.

[8]  Marlon Dumas,et al.  Correlation Patterns in Service-Oriented Architectures , 2007, FASE.

[9]  Daniel Gillblad,et al.  Discovering Process Models from Unlabelled Event Logs , 2009, BPM.

[10]  Wil M. P. van der Aalst,et al.  Service Mining: Using Process Mining to Discover, Check, and Improve Service Behavior , 2013, IEEE Transactions on Services Computing.

[11]  Ashutosh Tiwari,et al.  A review of business process mining: state-of-the-art and future trends , 2008, Bus. Process. Manag. J..

[12]  Donald B. Johnson,et al.  Finding All the Elementary Circuits of a Directed Graph , 1975, SIAM J. Comput..

[13]  Wil M. P. van der Aalst,et al.  Genetic process mining: an experimental evaluation , 2007, Data Mining and Knowledge Discovery.

[14]  Jan Mendling,et al.  Case Construction for Mining Supply Chain Processes , 2009, BIS.

[15]  B. Efron Nonparametric estimates of standard error: The jackknife, the bootstrap and other methods , 1981 .

[16]  Boudewijn F. van Dongen,et al.  The ProM Framework: A New Era in Process Mining Tool Support , 2005, ICATPN.

[17]  Schahram Dustdar,et al.  Discovering web service workflows using web services interaction mining , 2006, Int. J. Bus. Process. Integr. Manag..

[18]  Boudewijn F. van Dongen,et al.  Process Mining: Overview and Outlook of Petri Net Discovery Algorithms , 2009, Trans. Petri Nets Other Model. Concurr..

[19]  Paul W. P. J. Grefen,et al.  Correlation Mining: Mining Process Orchestrations Without Case Identifiers , 2015, ICSOC.

[20]  Luigi Pontieri,et al.  Mining Expressive Process Models by Clustering Workflow Traces , 2004, PAKDD.

[21]  Wil M. P. van der Aalst,et al.  Process Mining in Web Services: The WebSphere Case , 2008, IEEE Data Eng. Bull..

[22]  Wil M. P. van der Aalst,et al.  Fuzzy Mining - Adaptive Process Simplification Based on Multi-perspective Metrics , 2007, BPM.

[23]  Fabio Casati,et al.  Event correlation for process discovery from web service interaction logs , 2011, The VLDB Journal.