Generating process model collection with diverse label and structural features

The advancements in Business Process Management Systems (BPMS) have placed process models at the center of enterprise information systems. Due to the significant importance of process models, organizations are maintaining more and more process models to explicitly represent the flow of their business operations. The sheer number has led to the development of repositories to efficiently manage these collections. These repositories provide techniques for storing process models and searching relevant models against a given query process model. Searching involves comparing the query model with each source process model in the collection to compute the degree of similarity between query-source process model pair. While several techniques have been developed for that purpose, a direct comparison of these approaches have rarely been made. A key reason to that is, the absence of a freely available benchmark collection of process models that contains examples of process models with diverse features. To overcome that problem, we have employed a systematic and rigorous protocol to generate a diversified collection of process models, and compare it with the famous SAP's process model collection to establish the superiority of our collection. Further, we have applied a baseline approach to establish that the variants of process models (that we have generated) are significantly different from each other. It is pertinent to mention that our collection is freely available and we contend that the proposed collection will be useful in making a direct comparison of existing techniques and developing, evaluating and analyzing new techniques for process matching.

[1]  Jan Mendling,et al.  Metrics for Process Models: Empirical Foundations of Verification, Error Prediction, and Guidelines for Correctness , 2008, Lecture Notes in Business Information Processing.

[2]  Jan Mendling,et al.  Seven process modeling guidelines (7PMG) , 2010, Inf. Softw. Technol..

[3]  Monique Snoeck,et al.  Testing a Selection of BPMN Tools for Their Support of Modelling Guidelines , 2015, PoEM.

[4]  Kwanghyun Park,et al.  Corpora and Language Assessment: The State of the Art , 2014 .

[5]  Jan Mendling,et al.  Detection and prediction of errors in EPCs of the SAP reference model , 2008, Data Knowl. Eng..

[6]  Nicola Guarino,et al.  Business Process Management , 2016, Business & Information Systems Engineering.

[7]  Andrei Z. Broder,et al.  On the resemblance and containment of documents , 1997, Proceedings. Compression and Complexity of SEQUENCES 1997 (Cat. No.97TB100171).

[8]  Ralf Laue,et al.  A comparative survey of business process similarity measures , 2012, Comput. Ind..

[9]  I.N. Bozkurt,et al.  Authorship attribution , 2007, 2007 22nd international symposium on computer and information sciences.

[10]  Yorick Wilks,et al.  The METER corpus : a corpus for analysing journalistic text reuse , 2001 .

[11]  Rama Akkiraju,et al.  Discovering Business Process Similarities: An Empirical Study with SAP Best Practice Business Processes , 2010, ICSOC.

[12]  Remco M. Dijkman,et al.  Similarity Search of Business Process Models , 2009, IEEE Data Eng. Bull..

[13]  Mark Stevenson,et al.  Developing a corpus of plagiarised short answers , 2011, Lang. Resour. Evaluation.

[14]  Daniela Grigori,et al.  Process Analytics - Concepts and Techniques for Querying and Analyzing Process Data , 2016 .

[15]  Paul W. P. J. Grefen,et al.  Generating Synthetic Process Model Collections with Properties of Labeled Real-Life Models , 2014, AP-BPM.

[16]  Mathias Weske,et al.  Semantic Querying of Business Process Models , 2008, 2008 12th International IEEE Enterprise Distributed Object Computing Conference.

[17]  Wil M. P. van der Aalst,et al.  The Application of Petri Nets to Workflow Management , 1998, J. Circuits Syst. Comput..

[18]  Remco M. Dijkman,et al.  Similarity of business process models: Metrics and evaluation , 2011, Inf. Syst..

[19]  Paul W. P. J. Grefen,et al.  Business process model repositories:framework and survey , 2009 .

[20]  Jianmin Wang,et al.  Querying Process Models Based on the Temporal Relations between Tasks , 2011, 2011 IEEE 15th International Enterprise Distributed Object Computing Conference Workshops.

[21]  Mathias Weske,et al.  Querying process models by behavior inclusion , 2015, Software & Systems Modeling.

[22]  Chathura C. Ekanayake Consolidation of business process model collections , 2014 .

[23]  Birgit Korherr,et al.  Business Process Modelling: Languages, Goals, and Variabilities , 2008 .

[24]  Mathias Weske,et al.  Business Process Management: Concepts, Languages, Architectures , 2007 .

[25]  Paul W. P. J. Grefen,et al.  Fast business process similarity search , 2012, Distributed and Parallel Databases.

[26]  W. M. P. V. D. Aalsta,et al.  YAWL : yet another workflow language , 2015 .

[27]  Hector Garcia-Molina,et al.  Building a scalable and accurate copy detection mechanism , 1996, DL '96.

[28]  Paul W. P. J. Grefen,et al.  A Critical Evaluation and Framework of Business Process Improvement Methods , 2016, Bus. Inf. Syst. Eng..

[29]  Jan Mendling,et al.  Increasing Recall of Process Model Matching by Improved Activity Label Matching , 2013, BPM.

[30]  Paul W. P. J. Grefen,et al.  Generating process model collections , 2015, Software & Systems Modeling.

[31]  Wil M. P. van der Aalst,et al.  Process Equivalence: Comparing Two Process Models Based on Observed Behavior , 2006, Business Process Management.

[32]  Remco M. Dijkman,et al.  APROMORE: An advanced process model repository , 2011, Expert Syst. Appl..