Supporting Collaborative Modeling via Natural Language Processing

Engineering large-scale systems requires the collaboration among experts who use different modeling languages and create multiple models. Due to their independent creation and evolution, these models may exhibit discrepancies in terms of the domain concepts they represent. To help re-align the models without an explicit synchronization, we propose a technique that provides the modelers with suggested concepts that they may be interested in adding to their own models. The approach is modeling-language agnostic since it processes only the text in the models, such as the labels of elements and relationships. In this paper, we focus on determining the similarity of compound nouns, which are frequently used in conceptual models. We propose two algorithms, that make use of word embeddings and domain models, respectively. We report an early validation that assesses the effectiveness of our similarity algorithms against state-of-the-art machine learning algorithms with respect to human judgment.

[1]  Avigdor Gal,et al.  Uncertain Schema Matching , 2011, Uncertain Schema Matching.

[2]  Fabiano Dalpiaz,et al.  Towards Aligning Multi-concern Models via NLP , 2017, 2017 IEEE 25th International Requirements Engineering Conference Workshops (REW).

[3]  Judith N. Levi,et al.  The syntax and semantics of complex nominals , 1978 .

[4]  João Araújo,et al.  MATA: A Unified Approach for Composing UML Aspect Models Based on Graph Transformation , 2009, LNCS Trans. Aspect Oriented Softw. Dev..

[5]  Stefan Kögel,et al.  Automatic Change Recommendation of Models and Meta Models Based on Change Histories , 2016, ME@MoDELS.

[6]  Peter D. Turney Similarity of Semantic Relations , 2006, CL.

[7]  Jérôme Euzenat,et al.  Ontology Matching: State of the Art and Future Challenges , 2013, IEEE Transactions on Knowledge and Data Engineering.

[8]  Matthias Jarke,et al.  Near real-time collaborative modeling for view-based Web information systems engineering , 2017, Inf. Syst..

[9]  R. France,et al.  Aspect-oriented approach to early design modelling , 2004, IEE Proc. Softw..

[10]  Patrick Delfmann,et al.  Unified Enterprise Knowledge Representation with Conceptual Models - Capturing Corporate Language in Naming Conventions , 2009, ICIS.

[11]  Eneko Agirre,et al.  SemEval-2017 Task 1: Semantic Textual Similarity Multilingual and Crosslingual Focused Evaluation , 2017, *SEMEVAL.

[12]  Dimitris Karagiannis,et al.  On the Conceptualisation of Modelling Methods Using the ADOxx Meta Modelling Platform , 2013 .

[13]  Philippe Desfray,et al.  Viewpoint-Based Modeling-Towards Defining the Viewpoint Concept and Implications for Supporting Modeling Tools , 2012, EMISA.

[14]  Bashar Nuseibeh,et al.  ViewPoints: meaningful relationships are difficult! , 2003, 25th International Conference on Software Engineering, 2003. Proceedings..

[15]  M. de Rijke,et al.  Short Text Similarity with Word Embeddings , 2015, CIKM.

[16]  James Richard Curran,et al.  From distributional to semantic similarity , 2004 .

[17]  Hajo A. Reijers,et al.  Using Hidden Markov Models for the accurate linguistic analysis of process model activity labels , 2019, Inf. Syst..

[18]  Josep Carmona,et al.  Aligning Textual and Graphical Descriptions of Processes Through ILP Techniques , 2017, CAiSE.

[19]  Hajo A. Reijers,et al.  Comparing textual descriptions to process models - The automatic detection of inconsistencies , 2017, Inf. Syst..

[20]  Elisa Baniassad,et al.  Aspect-Oriented Analysis and Design , 2005 .

[21]  Sylvie Ranwez,et al.  Semantic Similarity from Natural Language and Ontology Analysis , 2015, Synthesis Lectures on Human Language Technologies.

[22]  Maria Lapata,et al.  The Disambiguation of Nominalizations , 2002, CL.

[23]  Diarmuid Ó Séaghdha,et al.  Using Lexical and Relational Similarity to Classify Semantic Relations , 2009, EACL.

[24]  Daniela Grigori,et al.  Process Matching Techniques , 2016 .

[25]  John Liu,et al.  sense2vec - A Fast and Accurate Method for Word Sense Disambiguation In Neural Word Embeddings , 2015, ArXiv.

[26]  J. Euzenat,et al.  Ontology Matching , 2007, Springer Berlin Heidelberg.

[27]  Douglas C. Schmidt,et al.  Ultra-Large-Scale Systems: The Software Challenge of the Future , 2006 .

[28]  Radu Calinescu,et al.  Large-scale complex IT systems , 2011, Commun. ACM.

[29]  Branko Perisic,et al.  Sirius: A rapid development of DSM graphical editor , 2014, IEEE 18th International Conference on Intelligent Engineering Systems INES 2014.

[30]  Dániel Varró,et al.  Property-Based Locking in Collaborative Modeling , 2017, 2017 ACM/IEEE 20th International Conference on Model Driven Engineering Languages and Systems (MODELS).

[31]  Sjaak Brinkkemper,et al.  Extracting conceptual models from user stories with Visual Narrator , 2017, Requirements Engineering.

[32]  Birgit Grammel,et al.  Model matching for trace link generation in model-driven software development , 2012, MODELS'12.