Using natural language to improve the generation of model transformation in software design

Among the present crucial issues in UML Modeling, one of the most common is about the fusion of similar models coming from various sources. Several similar models are created in Software Engineering and it is of primary interest to compare them and, when possible, to craft a general model including a specific one, or just identify models that are in fact equivalent. Most present approaches are based on model structure comparison and alignment on strings for attributes and classe names. This contribution evaluates the added value of several combined NLP techniques based on lexical networks, POS tagging, and Dependency Rules application, and how they might improve the fusion of models. Topics: use of NLP techniques in practical applications.

[1]  Helmut Schmidt,et al.  Probabilistic part-of-speech tagging using decision trees , 1994 .

[2]  Graeme Hirst,et al.  Evaluating WordNet-based Measures of Lexical Semantic Relatedness , 2006, CL.

[3]  Jérôme Euzenat,et al.  A Survey of Schema-Based Matching Approaches , 2005, J. Data Semant..

[4]  Dong Liu,et al.  Natural language requirements analysis and class model generation using UCDA , 2004 .

[5]  Olga Ormandjieva,et al.  Automatic Transition of Natural Language Software Requirements Specification into Formal Presentation , 2005, NLDB.

[6]  Paolo Tonella,et al.  Nomen est omen: analyzing the language of function identifiers , 1999, Sixth Working Conference on Reverse Engineering (Cat. No.PR00303).

[7]  Stephan Roser,et al.  An Approach to Automatically Generated Model Transformation Using Ontology Engineering Space , 2006 .

[8]  Ricardo Baeza-Yates,et al.  Flexible comparison of conceptual graphs , 2001 .

[9]  Alexander F. Gelbukh,et al.  Flexible Comparison of Conceptual GraphsWork done under partial support of CONACyT, CGEPI-IPN, and SNI, Mexico , 2001, DEXA.

[10]  Carol Friedman,et al.  Using contextual and lexical features to restructure and validate the classification of biomedical concepts , 2007, BMC Bioinformatics.

[11]  RahmErhard,et al.  A survey of approaches to automatic schema matching , 2001, VLDB 2001.

[12]  Erhard Rahm,et al.  A survey of approaches to automatic schema matching , 2001, The VLDB Journal.

[13]  Piek Vossen,et al.  EuroWordNet: A multilingual database with lexical semantic networks , 1998, Springer Netherlands.

[14]  David W. Binkley,et al.  What’s in a Name? A Study of Identifiers , 2006, 14th IEEE International Conference on Program Comprehension (ICPC'06).

[15]  Zohra Bellahsene,et al.  An Indexing Structure for Automatic Schema Matching , 2007, ICDE Workshops.

[16]  George A. Miller,et al.  WordNet: A Lexical Database for English , 1995, HLT.