Thresholds for error probability measures of business process models

The quality of conceptual business process models is highly relevant for the design of corresponding information systems. In particular, a precise measurement of model characteristics can be beneficial from a business perspective, helping to save costs thanks to early error detection. This is just as true from a software engineering point of view. In this latter case, models facilitate stakeholder communication and software system design. Research has investigated several proposals as regards measures for business process models, from a rather correlational perspective. This is helpful for understanding, for example size and complexity as general driving forces of error probability. Yet, design decisions usually have to build on thresholds, which can reliably indicate that a certain counter-action has to be taken. This cannot be achieved only by providing measures; it requires a systematic identification of effective and meaningful thresholds. In this paper, we derive thresholds for a set of structural measures for predicting errors in conceptual process models. To this end, we use a collection of 2,000 business process models from practice as a means of determining thresholds, applying an adaptation of the ROC curves method. Furthermore, an extensive validation of the derived thresholds was conducted by using 429 EPC models from an Australian financial institution. Finally, significant thresholds were adapted to refine existing modeling guidelines in a quantitative way.

[1]  Sandro Morasca,et al.  GUEST EDITORS' INTRODUCTION: KNOWLEDGE DISCOVERY FROM EMPIRICAL SOFTWARE ENGINEERING DATA , 1999 .

[2]  Mark E. Nissen,et al.  Redesigning Reengineering Through Measurement-Driven Inference , 1998, MIS Q..

[3]  David W. Hosmer,et al.  Applied Logistic Regression , 1991 .

[4]  D. Seligson,et al.  Clinical Chemistry , 1965, Bulletin de la Societe de chimie biologique.

[5]  Anas N. Al-Rabadi,et al.  A comparison of modified reconstructability analysis and Ashenhurst‐Curtis decomposition of Boolean functions , 2004 .

[6]  Wil M. P. van der Aalst,et al.  Translating unstructured workflow processes to readable BPEL: Theory and implementation , 2008, Inf. Softw. Technol..

[7]  Marta Indulska,et al.  How do practitioners use conceptual modeling in practice? , 2006, Data Knowl. Eng..

[8]  Jan Mendling,et al.  A Study Into the Factors That Influence the Understandability of Business Process Models , 2011, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.

[9]  David J. Hand,et al.  Measuring classifier performance: a coherent alternative to the area under the ROC curve , 2009, Machine Learning.

[10]  M. Zweig,et al.  Receiver-operating characteristic (ROC) plots: a fundamental evaluation tool in clinical medicine. , 1993, Clinical chemistry.

[11]  J. Sweller Implications of Cognitive Load Theory for Multimedia Learning , 2005, The Cambridge Handbook of Multimedia Learning.

[12]  Tom DeMarco,et al.  Controlling Software Projects , 1982 .

[13]  Cemal Yilmaz,et al.  Software Metrics , 2008, Wiley Encyclopedia of Computer Science and Engineering.

[14]  Shari Lawrence Pfleeger,et al.  Software metrics (2nd ed.): a rigorous and practical approach , 1997 .

[15]  Arthur H. M. ter Hofstede,et al.  Automated Error Correction of Business Process Models , 2011, BPM.

[16]  Wil M. P. van der Aalst,et al.  On the Suitability of BPMN for Business Process Modelling , 2006, Business Process Management.

[17]  Dirk Fahland,et al.  Analysis on demand: Instantaneous soundness checking of industrial business process models , 2011, Data Knowl. Eng..

[18]  Raed Shatnawi An Investigation of CK Metrics Thresholds , 2006 .

[19]  Chris F. Kemerer,et al.  A Metrics Suite for Object Oriented Design , 2015, IEEE Trans. Software Eng..

[20]  Ekkart Kindler,et al.  On the semantics of EPCs: Resolving the vicious circle , 2006, Data Knowl. Eng..

[21]  Victor R. Basili,et al.  Software errors and complexity: an empirical investigation , 1993 .

[22]  R. Doerge,et al.  Empirical threshold values for quantitative trait mapping. , 1994, Genetics.

[23]  J. Recker,et al.  Does It Matter Which Process Modelling Language We Teach or Use? An Experimental Study on Understanding Process Modelling Languages without Formal Education , 2007 .

[24]  Jorge S. Cardoso,et al.  Process control-flow complexity metric: An empirical validation , 2006, 2006 IEEE International Conference on Services Computing (SCC'06).

[25]  Wil M. P. van der Aalst,et al.  Workflow Patterns , 2004, Distributed and Parallel Databases.

[26]  Jan Mendling,et al.  Empirical Studies in Process Model Verification , 2009, Trans. Petri Nets Other Model. Concurr..

[27]  Thomas Teufel,et al.  Sap R/3 Process Oriented Implementation , 1998 .

[28]  R. Bender,et al.  Quantitative risk assessment in epidemiological studies investigating threshold effects , 1999 .

[29]  John Krogstie,et al.  Process models representing knowledge for action: a revised quality framework , 2006, Eur. J. Inf. Syst..

[30]  Jörg Becker,et al.  Guidelines of Business Process Modeling , 2000, Business Process Management.

[31]  J. Hanley,et al.  The meaning and use of the area under a receiver operating characteristic (ROC) curve. , 1982, Radiology.

[32]  Jan Mendling,et al.  Understanding the Occurrence of Errors in Process Models Based on Metrics , 2007, OTM Conferences.

[33]  Mario Piattini,et al.  An Exploratory Experiment to Validate Measures for Business Process Models , 2007, RCIS.

[34]  Shari Lawrence Pfleeger,et al.  Software Metrics : A Rigorous and Practical Approach , 1998 .

[35]  Barry W. Boehm,et al.  Software Engineering Economics , 1993, IEEE Transactions on Software Engineering.

[36]  Jean-Paul Chilès,et al.  Wiley Series in Probability and Statistics , 2012 .

[37]  Jan Mendling,et al.  On a Quest for Good Process Models: The Cross-Connectivity Metric , 2008, CAiSE.

[38]  Claus Lewerentz,et al.  Applying design-metrics to object-oriented frameworks , 1996, Proceedings of the 3rd International Software Metrics Symposium.

[39]  T. R. G. Green Conditional program statements and their comprehensibility to professional programmers , 1977 .

[40]  Gang Soo Lee,et al.  An empirical Study on Complexity Metrics of Petri Nets , 1990 .

[41]  Javier Esparza Reachability in Live and Safe Free-Choice Petri Nets is NP-Complete , 1998, Theor. Comput. Sci..

[42]  Frank Leymann,et al.  Faster and More Focused Control-Flow Analysis for Business Process Models Through SESE Decomposition , 2007, ICSOC.

[43]  Ritu Agarwal,et al.  Comprehending Object and Process Models: An Empirical Study , 1999, IEEE Trans. Software Eng..

[44]  Jan Mendling,et al.  Metrics for Process Models: Empirical Foundations of Verification, Error Prediction, and Guidelines for Correctness , 2008, Lecture Notes in Business Information Processing.

[45]  Jan Mendling,et al.  Seven process modeling guidelines (7PMG) , 2010, Inf. Softw. Technol..

[46]  Jan Mendling,et al.  Detection and prediction of errors in EPCs of the SAP reference model , 2008, Data Knowl. Eng..

[47]  Iris Vessey,et al.  Cognitive Fit: A Theory‐Based Analysis of the Graphs Versus Tables Literature* , 1991 .

[48]  Raed Shatnawi,et al.  Finding software metrics threshold values using ROC curves , 2010, J. Softw. Maintenance Res. Pract..

[49]  Khaled El Emam,et al.  Thresholds for object-oriented measures , 2000, Proceedings 11th International Symposium on Software Reliability Engineering. ISSRE 2000.

[50]  Jan Mendling,et al.  Process instantiation , 2009, Data Knowl. Eng..

[51]  Jan Mendling,et al.  Syntax highlighting in business process models , 2011, Decis. Support Syst..

[52]  Marian Petre,et al.  Usability Analysis of Visual Programming Environments: A 'Cognitive Dimensions' Framework , 1996, J. Vis. Lang. Comput..

[53]  Victor R. Basili,et al.  Software errors and complexity: an empirical investigation0 , 1984, CACM.

[54]  N. Graham,et al.  Areas beneath the relative operating characteristics (ROC) and relative operating levels (ROL) curves: Statistical significance and interpretation , 2002 .

[55]  Colin Ware,et al.  Cognitive Measurements of Graph Aesthetics , 2002, Inf. Vis..

[56]  Victor R. Basili,et al.  The TAME Project: Towards Improvement-Oriented Software Environments , 1988, IEEE Trans. Software Eng..

[57]  Jinwoo Kim,et al.  Why Are Some Diagrams Easier to Work With? : Effects of Diagrammatic Representation on the Cognitive Integration Process of Systems Analysis and Design , 1999 .

[58]  Keith Phalp,et al.  Embedding requirements within Model-Driven Architecture , 2011, Software Quality Journal.

[59]  Sandro Morasca,et al.  Measuring attributes of concurrent software specifications in Petri nets , 1999, Proceedings Sixth International Software Metrics Symposium (Cat. No.PR00403).

[60]  Daniel L. Moody,et al.  Theoretical and practical issues in evaluating the quality of conceptual models: current state and future directions , 2005, Data Knowl. Eng..

[61]  Mario Piattini,et al.  A family of experiments to validate metrics for software process models , 2005, J. Syst. Softw..

[62]  Wil M. P. van der Aalst,et al.  An Alternative Way to Analyze Workflow Graphs , 2002, CAiSE.