On the refactoring of activity labels in business process models

Large corporations increasingly utilize business process models for documenting and redesigning their operations. The extent of such modeling initiatives with several hundred models and dozens of often hardly trained modelers calls for automated quality assurance. While formal properties of control flow can easily be checked by existing tools, there is a notable gap for checking the quality of the textual content of models, in particular, its activity labels. In this paper, we address the problem of activity label quality in business process models. We designed a technique for the recognition of labeling styles, and the automatic refactoring of labels with quality issues. More specifically, we developed a parsing algorithm that is able to deal with the shortness of activity labels, which integrates natural language tools like WordNet and the Stanford Parser. Using three business process model collections from practice with differing labeling style distributions, we demonstrate the applicability of our technique. In comparison to a straightforward application of standard natural language tools, our technique provides much more stable results. As an outcome, the technique shifts the boundary of process model quality issues that can be checked automatically from syntactic to semantic aspects.

[1]  Remco M. Dijkman,et al.  Merging Business Process Models , 2010, OTM Conferences.

[2]  Christiane Fellbaum,et al.  Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[3]  Mathias Weske,et al.  Action Patterns in Business Process Models , 2009, ICSOC/ServiceWave.

[4]  Carol Friedman,et al.  Research Paper: A General Natural-language Text Processor for Clinical Radiology , 1994, J. Am. Medical Informatics Assoc..

[5]  Jan Mendling,et al.  Activity labeling in process modeling: Empirical insights and recommendations , 2010, Inf. Syst..

[6]  Jason Crampton,et al.  Delegation and satisfiability in workflow systems , 2008, SACMAT '08.

[7]  Robert W. Blanning,et al.  Synthesis and Decomposition of Processes in Organizations , 2003, Inf. Syst. Res..

[8]  Olivia R. Liu Sheng,et al.  Formulating the Data-Flow Perspective for Business Process Management , 2006, Inf. Syst. Res..

[9]  Jan Mendling,et al.  On the Automatic Labeling of Process Models , 2011, CAiSE.

[10]  Beatrice Santorini,et al.  Building a Large Annotated Corpus of English: The Penn Treebank , 1993, CL.

[11]  Remco M. Dijkman,et al.  The ICoP Framework: Identification of Correspondences between Process Models , 2010, CAiSE.

[12]  Kevin Crowston,et al.  Organizing Business Knowledge: The MIT Process Handbook , 2003 .

[13]  Lionel C. Briand,et al.  An Automated Approach to Transform Use Cases into Activity Diagrams , 2010, ECMFA.

[14]  Dan Klein,et al.  Fast Exact Inference with a Factored Model for Natural Language Parsing , 2002, NIPS.

[15]  Flávia Maria Santoro,et al.  Business process mining from group stories , 2009, 2009 13th International Conference on Computer Supported Cooperative Work in Design.

[16]  John Krogstie,et al.  Process models representing knowledge for action: a revised quality framework , 2006, Eur. J. Inf. Syst..

[17]  Paolo Tonella,et al.  Supporting Ontology-Based Semantic Annotation of Business Processes with Automated Suggestions , 2010, Int. J. Inf. Syst. Model. Des..

[18]  Jan Mendling,et al.  Beyond soundness: on the verification of semantic business process models , 2010, Distributed and Parallel Databases.

[19]  Rob Davis BSc Business Process Modelling with ARIS: A Practical Guide , 2001, Springer London.

[20]  Jörg Becker,et al.  Towards increased comparability of conceptual models - Enforcing naming conventions through domain thesauri and linguistic grammars , 2009, ECIS.

[21]  Julius T. Tou,et al.  Information Systems , 1973, GI Jahrestagung.

[22]  Elisa Bertino,et al.  The specification and enforcement of authorization constraints in workflow management systems , 1999, TSEC.

[23]  Gretchen Hargis,et al.  Readability and computer documentation , 2000, AJCD.

[24]  Jörg Becker,et al.  Guidelines of Business Process Modeling , 2000, Business Process Management.

[25]  Akhil Kumar,et al.  XML - Based Schema Definition for Support of Interorganizational Workflow , 2003, Inf. Syst. Res..

[26]  Wil M. P. van der Aalst,et al.  Workflow Verification: Finding Control-Flow Errors Using Petri-Net-Based Techniques , 2000, Business Process Management.

[27]  Jan Mendling,et al.  Refactoring of Process Model Activity Labels , 2010, NLDB.

[28]  Nigel G. Ward Machine Translation: Past, Present, Future , 2001 .

[29]  Biing-Hwang Juang,et al.  Fundamentals of speech recognition , 1993, Prentice Hall signal processing series.

[30]  Mark Strembeck,et al.  Influence Factors of Understanding Business Process Models , 2008, BIS.

[31]  Lawrence D. Miles,et al.  Techniques Of Value Analysis And Engineering , 1961 .

[32]  Jan Mendling,et al.  Empirical Studies in Process Model Verification , 2009, Trans. Petri Nets Other Model. Concurr..

[33]  Amit M. Paradkar,et al.  Use Cases to Process Specifications in Business Process Modeling Notation , 2010, 2010 IEEE International Conference on Web Services.

[34]  Mark Strembeck,et al.  Modeling process-related RBAC models with extended UML activity models , 2011, Inf. Softw. Technol..

[35]  Marc Ehrig,et al.  Measuring Similarity between Semantic Business Process Models , 2007, APCCM.

[36]  William J. Kettinger,et al.  Business Process Change: A Study of Methodologies, Techniques, and Tools , 1997, MIS Q..

[37]  Barbara J. Grosz,et al.  Natural-Language Processing , 1982, Artificial Intelligence.

[38]  Liang-Jie Zhang,et al.  Policy-Driven Process Mapping (PDPM): Discovering process models from business policies , 2009, Decis. Support Syst..

[39]  Frederick Jelinek,et al.  Statistical methods for speech recognition , 1997 .

[40]  Jan Mendling,et al.  Recognising Activity Labeling Styles in Business Process Models , 2011, Enterp. Model. Inf. Syst. Archit. Int. J. Concept. Model..

[41]  James F. Allen Natural language processing , 1979 .

[42]  Jan Mendling,et al.  Metrics for Process Models: Empirical Foundations of Verification, Error Prediction, and Guidelines for Correctness , 2008, Lecture Notes in Business Information Processing.

[43]  Robert Dixon,et al.  Deriving verbs in English , 2008 .

[44]  Jan Mendling,et al.  Seven process modeling guidelines (7PMG) , 2010, Inf. Softw. Technol..

[45]  Bashar Nuseibeh,et al.  Requirements engineering: a roadmap , 2000, ICSE '00.

[46]  Remco M. Dijkman,et al.  Meronymy-Based Aggregation of Activities in Business Process Models , 2010, ER.

[47]  Thomas Teufel,et al.  SAP R/3 Process Oriented Implementation: Iterative Process Prototyping , 1998 .

[48]  Paolo Tonella,et al.  Semantically-Aided Business Process Modeling , 2009, International Semantic Web Conference.

[49]  Ingo Weber,et al.  User-Friendly Semantic Annotation in Business Process Modeling , 2007, WISE Workshops.

[50]  Dirk Fahland,et al.  Analysis on demand: Instantaneous soundness checking of industrial business process models , 2011, Data Knowl. Eng..

[51]  Heinrich C. Mayr,et al.  Deriving static and dynamic concepts from software requirements using sophisticated tagging , 2007, Data Knowl. Eng..

[52]  Aditya K. Ghose,et al.  Rapid Business Process Discovery ( R- BPD) , 2007, ER.

[53]  Avigdor Gal,et al.  Machine-Assisted Design of Business Process Models Using Descriptor Space Analysis , 2010, BPM.

[54]  Keith Phalp,et al.  Improving the quality of use case descriptions: empirical assessment of writing guidelines , 2007, Software Quality Journal.

[55]  Branimir Boguraev,et al.  A linguistic analysis engine for natural language use case description and its application to dependability analysis in industrial use cases , 2009, 2009 IEEE/IFIP International Conference on Dependable Systems & Networks.

[56]  W. Marsden I and J , 2012 .

[57]  Patrick McDermott,et al.  Workflow Modeling: Tools for Process Improvement and Application Development , 2001 .

[58]  Remco M. Dijkman,et al.  Similarity of business process models: Metrics and evaluation , 2011, Inf. Syst..

[59]  Aditya K. Ghose,et al.  Process Discovery from Model and Text Artefacts , 2007, 2007 IEEE Congress on Services (Services 2007).

[60]  Terry A. Halpin,et al.  Automated Verbalization for ORM 2 , 2006, OTM Workshops.

[61]  George A. Miller,et al.  WordNet: A Lexical Database for English , 1995, HLT.

[62]  Natalia Sidorova,et al.  Soundness verification for conceptual workflow nets with data: Early detection of errors with the most precision possible , 2011, Inf. Syst..

[63]  Michael Schrefl,et al.  View integration of behavior in object-oriented databases , 2001, Data Knowl. Eng..

[64]  Jon Espen Ingvaldsen,et al.  A Text Mining Approach to Integrating Business Process Models and Governing Documents , 2005, OTM Workshops.

[65]  Dan Klein,et al.  Accurate Unlexicalized Parsing , 2003, ACL.

[66]  Michael Rosemann,et al.  Potential pitfalls of process modeling: part A , 2006, Bus. Process. Manag. J..

[67]  R. Flesch How to test readability , 1951 .

[68]  John Cocke,et al.  A Statistical Approach to Machine Translation , 1990, CL.

[69]  Daniela Grigori,et al.  Ranking BPEL Processes for Service Discovery , 2010, IEEE Transactions on Services Computing.

[70]  Akhil Kumar,et al.  Research Commentary: Workflow Management Issues in e-Business , 2002, Inf. Syst. Res..

[71]  Patrick Delfmann,et al.  Supporting Distributed Conceptual Modelling through Naming Conventions - A Tool-based Linguistic Approach , 2009, Enterp. Model. Inf. Syst. Archit. Int. J. Concept. Model..

[72]  Theo P. van der Weide,et al.  Information modeling: The process and the required competencies of its participants , 2004, Data Knowl. Eng..

[73]  Mark Davies The Corpus of Contemporary American English (COCA) , 2012 .

[74]  Peter Jackson,et al.  Natural Language Processing of Online Applications , 2002 .

[75]  Branimir Boguraev,et al.  An Analysis Engine for Dependable Elicitation of Natural Language Use Case Description and Its Application to Industrial Use Cases , 2008 .

[76]  Jan Mendling,et al.  On Labeling Quality in Business Process Models , 2009 .

[77]  Wil M. P. van der Aalst,et al.  An Alternative Way to Analyze Workflow Graphs , 2002, CAiSE.

[78]  Michael Rosemann,et al.  Factors and measures of business process modelling: model building through a multiple case study , 2005, Eur. J. Inf. Syst..

[79]  Peter Jackson,et al.  Natural language processing for online applications : text retrieval, extraction and categorization , 2002 .

[80]  Volker Gruhn,et al.  Detecting Common Errors in Event-Driven Process Chains by Label Analysis , 2011, Enterp. Model. Inf. Syst. Archit. Int. J. Concept. Model..

[81]  Hao Ding,et al.  Ontology-based Semantic Annotation for Semantic Interoperability of Process Models , 2005, International Conference on Computational Intelligence for Modelling, Control and Automation and International Conference on Intelligent Agents, Web Technologies and Internet Commerce (CIMCA-IAWTIC'06).

[82]  Christian Fillies,et al.  Pragmatic applications of the Semantic Web using SemTalk , 2003, Comput. Networks.

[83]  Jan Mendling,et al.  Process Model Generation from Natural Language Text , 2011, CAiSE.

[84]  Christian Kop,et al.  From textual scenarios to a conceptual schema , 2005, Data Knowl. Eng..

[85]  J. Euzenat,et al.  Ontology Matching , 2007, Springer Berlin Heidelberg.

[86]  Jan Mendling,et al.  A Study Into the Factors That Influence the Understandability of Business Process Models , 2011, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.

[87]  Matthias Weidlich,et al.  Automatic Generation of Glossaries for Process Modelling Support , 2011, Enterp. Model. Inf. Syst. Archit. Int. J. Concept. Model..