Comparing textual descriptions to process models - The automatic detection of inconsistencies

Many organizations maintain textual process descriptions alongside graphical process models. The purpose is to make process information accessible to various stakeholders, including those who are not familiar with reading and interpreting the complex execution logic of process models. Despite this merit, there is a clear risk that model and text become misaligned when changes are not applied to both descriptions consistently. For organizations with hundreds of different processes, the effort required to identify and clear up such conflicts is considerable. To support organizations in keeping their process descriptions consistent, we present an approach to automatically identify inconsistencies between a process model and a corresponding textual description. Our approach detects cases where the two process representations describe activities in different orders and detect process model activities not contained in the textual description. A quantitative evaluation with 53 real-life model-text pairs demonstrates that our approach accurately identifies inconsistencies between model and text. HighlightsWe propose an approach to detect conflicts between textual and model-based process descriptions.The approach is fully automatic based on tailored natural language processing techniques.Quantitative evaluation demonstrates the applicability of the approach on real-life data.

[1]  Jan Mendling,et al.  Predicting the Quality of Process Model Matching , 2013, BPM.

[2]  Imran Sarwar Bajwa,et al.  From Natural Language Software Specifications to UML Class Models , 2011, ICEIS.

[3]  Dekang Lin,et al.  An Information-Theoretic Definition of Similarity , 1998, ICML.

[4]  Jan Mendling,et al.  Increasing Recall of Process Model Matching by Improved Activity Label Matching , 2013, BPM.

[5]  Fausto Giunchiglia,et al.  Semantic Matching , 2018, Encyclopedia of Database Systems.

[6]  Avigdor Gal,et al.  Schema matching prediction with applications to data source discovery and dynamic ensembling , 2013, The VLDB Journal.

[7]  Alon Y. Halevy,et al.  Semantic Integration Research in the Database Community : A Brief Survey , 2005 .

[8]  Owen Rambow,et al.  The ModelExplainer , 1996, INLG.

[9]  Mirjam Minor,et al.  Extracting and enriching workflows from text , 2013, 2013 IEEE 14th International Conference on Information Reuse & Integration (IRI).

[10]  Sophia Ananiadou,et al.  Generating Natural Language specifications from UML class diagrams , 2008, Requirements Engineering.

[11]  Boualem Benatallah,et al.  Enabling the Analysis of Cross-Cutting Aspects in Ad-Hoc Processes , 2013, CAiSE.

[12]  Wei-Ying Ma,et al.  Instance-based Schema Matching for Web Databases by Domain-specific Query Probing , 2004, VLDB.

[13]  Alexander H. Waibel,et al.  Decoding Algorithm in Statistical Machine Translation , 1997, ACL.

[14]  Thomas Allweyer,et al.  BPMN 2.0 : introduction to the standard for business process modeling , 2016 .

[15]  Hajo A. Reijers,et al.  On the Fragmentation of Process Information: Challenges, Solutions, and Outlook , 2015, BMMDS/EMMSAD.

[16]  Jan Mendling,et al.  Process Model Generation from Natural Language Text , 2011, CAiSE.

[17]  Jan Mendling,et al.  Supporting Process Model Validation through Natural Language Generation , 2014, IEEE Transactions on Software Engineering.

[18]  Erhard Rahm,et al.  Schema and ontology matching with COMA++ , 2005, SIGMOD '05.

[19]  Xiaohua Hu,et al.  The Evaluation of Sentence Similarity Measures , 2008, DaWaK.

[20]  Hinrich Schütze,et al.  Introduction to information retrieval , 2008 .

[21]  Avigdor Gal,et al.  Uncertain Schema Matching , 2011, Uncertain Schema Matching.

[22]  Remco M. Dijkman,et al.  Business Process Model Merging: An Approach to Business Process Consolidation , 2013, TSEM.

[23]  Alon Y. Halevy,et al.  Semantic Integration , 2005, AI Mag..

[24]  Jérôme Euzenat,et al.  Ontology Matching: State of the Art and Future Challenges , 2013, IEEE Transactions on Knowledge and Data Engineering.

[25]  Mihai Surdeanu,et al.  The Stanford CoreNLP Natural Language Processing Toolkit , 2014, ACL.

[26]  Erhard Rahm,et al.  A survey of approaches to automatic schema matching , 2001, The VLDB Journal.

[27]  P ? ? ? ? ? ? ? % ? ? ? ? , 1991 .

[28]  Remco M. Dijkman,et al.  Similarity of business process models: Metrics and evaluation , 2011, Inf. Syst..

[29]  Fernando Gomez,et al.  A System for the Semiautomatic Generation of E-R Models from Natural Language Specifications , 1999, Data Knowl. Eng..

[30]  Mathias Weske,et al.  Propagating changes between aligned process models , 2012, J. Syst. Softw..

[31]  Mathias Weske,et al.  Behavioral Similarity - A Proper Metric , 2011, BPM.

[32]  Carlo Strapparava,et al.  Corpus-based and Knowledge-based Measures of Text Semantic Similarity , 2006, AAAI.

[33]  Marc Ehrig,et al.  Measuring Similarity between Semantic Business Process Models , 2007, APCCM.

[34]  Saonee Sarker,et al.  An Exploration into the Process of Requirements Elicitation: A Grounded Approach , 2010, J. Assoc. Inf. Syst..

[35]  Flávia Maria Santoro,et al.  Business process mining from group stories , 2009, 2009 13th International Conference on Computer Supported Cooperative Work in Design.

[36]  Natalya F. Noy,et al.  Semantic integration: a survey of ontology-based approaches , 2004, SGMD.

[37]  Georg Grossmann,et al.  Formalising Natural Language Specifications Using a Cognitive Linguistics/Configuration Based Approach , 2013, 2013 17th IEEE International Enterprise Distributed Object Computing Conference.

[38]  Remco M. Dijkman,et al.  Report: The Process Model Matching Contest 2013 , 2013, Business Process Management Workshops.

[39]  J. Euzenat,et al.  Ontology Matching , 2007, Springer Berlin Heidelberg.

[40]  Andreas Oberweis,et al.  Triple-S: A Matching Approach for Petri Nets on Syntactic, Semantic and Structural level , 2013, BPM 2013.

[41]  Marlon Dumas,et al.  Clone Detection in Repositories of Business Process Models , 2011, BPM.

[42]  Christopher D. Manning,et al.  The Stanford Typed Dependencies Representation , 2008, CF+CDPE@COLING.

[43]  Amit M. Paradkar,et al.  Use Cases to Process Specifications in Business Process Modeling Notation , 2010, 2010 IEEE International Conference on Web Services.

[44]  Remco M. Dijkman,et al.  The ICoP Framework: Identification of Correspondences between Process Models , 2010, CAiSE.

[45]  Hajo A. Reijers,et al.  Detecting Inconsistencies Between Process Models and Textual Descriptions , 2015, BPM.

[46]  James H. Martin,et al.  Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition , 2000 .

[47]  Remco,et al.  Business Process Model Merging , 2013 .