Determination of Natural Language Processing Tasks and Tools for Topological Functioning Modelling

Topological Functioning Modelling (TFM) is based on analysis of exhaustive verbal descriptions of the domain functionality. Manual acquisition of knowledge about the domain from text in natural language requires a lot of resources. Natural Language Processing (NLP) tools provide automatic analysis of text in natural language and may fasten and make cheaper this process. First, the knowledge, its expressing elements of the English language, and processing tasks that are required for construction of the topological functioning model are identified. The overview of the support of these tasks by the main NLP pipelines is based on the available documentation without performing practical experiments. The results showed that among the selected six NLP pipelines the largest support comes from the Stanford CoreNLP toolkit, FreeLing, and NLTK toolkit. They allow analysing not only the words and sentences, but also dependencies in word groups and between sentences. The obtained results can be used for academics and practitioners that perform research on NLP for composition of domain (business, system, software) models.

[1]  Jan Mendling,et al.  Supporting Process Model Validation through Natural Language Generation , 2014, IEEE Transactions on Software Engineering.

[2]  Armands Slihte,et al.  Transforming Textual use Cases to a Computation Independent Model , 2010, MDA/MTDD.

[3]  Hugo Gonçalo Oliveira,et al.  Comparing the Performance of Different NLP Toolkits in Formal and Social Media Text , 2016, SLATE.

[4]  Mario Cannataro,et al.  Knowledge management and XML: derivation of synthetic views over semi-structured data , 2002, SIAP.

[5]  S. Abirami,et al.  Conceptual modeling of natural language functional requirements , 2014, J. Syst. Softw..

[6]  Erika Asnina,et al.  Specification of decision-making and control flow branching in Topological Functioning Models of systems , 2015, 2015 International Conference on Evaluation of Novel Approaches to Software Engineering (ENASE).

[7]  Yuichi Ohta,et al.  Knowledge extraction from diagram and text for media integration , 1996, Proceedings of the Third IEEE International Conference on Multimedia Computing and Systems.

[8]  Julio C. Facelli,et al.  Automatic Extraction of Nanoparticle Properties Using Natural Language Processing: NanoSifter an Application to Acquire PAMAM Dendrimer Properties , 2014, PloS one.

[9]  Olga Ormandjieva,et al.  Models Derived from Automatically Analyzed Textual User Requirements , 2006, Fourth International Conference on Software Engineering Research, Management and Applications (SERA'06).

[10]  Mário Rodrigues,et al.  Advanced Applications of Natural Language Processing for Performing Information Extraction , 2015 .

[11]  Armands Slihte,et al.  The Integrated Domain Modeling: A Case Study , 2014 .

[12]  Paul Vickers,et al.  Parsed use case descriptions as a basis for object-oriented class model generation , 2011, J. Syst. Softw..