RoSE: Reoccurring Structures Detection in BPMN 2.0 Process Model Collections

The detection of structural similarities of process models is frequently discussed in the literature. The state-of-the-art approaches for structural similarities of process models presume a known subgraph that is searched in a larger graph, and utilize behavioral and textual semantics to achieve their goal. In this paper we propose an approach to detect reoccurring structures in a collection of BPMN 2.0 process models, without the knowledge of a subgraph to be searched, and by focusing solely on the structural characteristics of the process models. The proposed approach deals with the problems of subgraph isomorphism, frequent pattern discovery and maximum common subgraph isomorphism, which are mentioned as NP-hard in the literature. In this work we present a formal model and a novel algorithm for the detection of reoccurring structures in a collection of BPMN 2.0 process models. We then apply the algorithm to a collection of 1,806 real-world process models and provide a quantitative and qualitative analysis of the results.

[1]  Wil M. P. van der Aalst,et al.  Workflow Patterns , 2003, Distributed and Parallel Databases.

[2]  Frank Leymann,et al.  Integrating Compliance into Business Processes: Process Fragments as Reusable Compliance Controls , 2010 .

[3]  Andrea Burattin,et al.  Process Mining Techniques in Business Environments , 2015, Lecture Notes in Business Information Processing.

[4]  Wineke A. M. van Lent,et al.  Similarity of business process models : metrics and evaluation , 2009 .

[5]  Jussi Vanhatalo,et al.  Simplified Computation and Generalization of the Refined Process Structure Tree , 2010, WS-FM.

[6]  Cesare Pautasso,et al.  On the Road to Benchmarking BPMN 2.0 Workflow Engines , 2015, ICPE.

[7]  G. Valiente,et al.  An algorithm for graph pattern-matching , 1997 .

[8]  Wil M. P. van der Aalst,et al.  Workflow Patterns , 2004, Distributed and Parallel Databases.

[9]  Pit Pietsch,et al.  Comparison of BPMN2 Diagrams , 2012, BPMN.

[10]  Marlon Dumas,et al.  Approximate Clone Detection in Repositories of Business Process Models , 2012, BPM.

[11]  Remco M. Dijkman,et al.  A Short Survey on Process Model Similarity , 2013, Seminal Contributions to Information Systems Engineering.

[12]  Paul W. P. J. Grefen,et al.  Generating Synthetic Process Model Collections with Properties of Labeled Real-Life Models , 2014, AP-BPM.

[13]  Jan Mendling,et al.  Enabling Reuse of Process Models through the Detection of Similar Process Parts , 2012, Business Process Management Workshops.

[14]  Gabriel Valiente,et al.  Algorithms on Trees and Graphs , 2002, Springer Berlin Heidelberg.

[15]  Jan Recker,et al.  How Much Language Is Enough? Theoretical and Practical Use of the Business Process Modeling Notation , 2008, CAiSE.

[16]  Jiawei Han,et al.  gSpan: graph-based substructure pattern mining , 2002, 2002 IEEE International Conference on Data Mining, 2002. Proceedings..

[17]  Matjaz B. Juric,et al.  An Empirical Analysis of Business Process Execution Language Usage , 2014, IEEE Transactions on Software Engineering.

[18]  Julian R. Ullmann,et al.  An Algorithm for Subgraph Isomorphism , 1976, J. ACM.

[19]  Remco M. Dijkman,et al.  APROMORE: An advanced process model repository , 2011, Expert Syst. Appl..

[20]  Marie-Christine Fauvet,et al.  Fragment-Based Version Management for Repositories of Business Process Models , 2011, OTM Conferences.

[21]  Sergey Ivanov,et al.  BPMNDiffViz: A Tool for BPMN Models Comparison , 2015, BPM.

[22]  Arvind Gupta,et al.  The Complexity of Subgraph Isomorphism Duality Results for Graphs of Bounded Path and Tree Width , 1995 .

[23]  Patrick Delfmann,et al.  Graph theory and model collection management: conceptual framework and runtime analysis of selected graph algorithms , 2014, Information Systems and e-Business Management.

[24]  Joost N. Kok,et al.  The Gaston Tool for Frequent Subgraph Mining , 2005, GraBaTs.

[25]  Mark von Rosing,et al.  Business Process Model and Notation - BPMN , 2015, The Complete Business Process Handbook, Vol. I.

[26]  Cesare Pautasso,et al.  A Container-centric Methodology for Benchmarking Workflow Management Systems , 2016, CLOSER.

[27]  Paul W. P. J. Grefen,et al.  Generating process model collections , 2015, Software & Systems Modeling.

[28]  Frank Leymann,et al.  Application of Sub-Graph Isomorphism to Extract Reoccurring Structures from BPMN 2.0 Process Models , 2015, 2015 IEEE Symposium on Service-Oriented System Engineering.

[29]  Dror G. Feitelson,et al.  Workload Modeling for Computer Systems Performance Evaluation , 2015 .

[30]  Mario Vento,et al.  A (sub)graph isomorphism algorithm for matching large graphs , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[31]  Marlon Dumas,et al.  Fast detection of exact clones in business process model repositories , 2013, Inf. Syst..