Automatic Extraction of Structural Model from Semi Structured Software Requirement Specification

The software requirement specifications are usually documented either in unstructured, semi structured or structured format. The requirements specified in unstructured format are written in simple continuous paragraph and the structured format specifies requirements by means of diagrams. The semi-structured format represents requirements with the help of some keywords. Literature suggests that the rule based work has been the common choice for unstructured format of documenting. However, these rule based works do not yield satisfactory results for semi-structured format. Consequently, these rules need to re-framed in order to apply them for the semi-structured formatted documents. In this paper, we present an improvement on the existing rules considering the keywords present in the text. The technique involves automatic extraction of the class diagrams using NLP tools and techniques. Along with existing rules, the newly formulated rules have been tested for different case studies and suitable metrics have been devised to evaluate the obtained results. Results show that the automatically generated class diagram have 82% average precision value and 94% average recall value when compared to the class diagrams generated by the human experts.

[1]  Danqi Chen,et al.  A Fast and Accurate Dependency Parser using Neural Networks , 2014, EMNLP.

[2]  S. Abirami,et al.  Conceptual modeling of natural language functional requirements , 2014, J. Syst. Softw..

[3]  Jugal K. Kalita,et al.  GOOAL: a Graphic Object Oriented Analysis Laboratory , 2002, OOPSLA '02.

[4]  Asunción Gómez-Pérez,et al.  Natural Language-Based Approach for Helping in the Reuse of Ontology Design Patterns , 2008, EKAW.

[5]  Sjaak Brinkkemper,et al.  Automated Extraction of Conceptual Models from User Stories via NLP , 2016, 2016 IEEE 24th International Requirements Engineering Conference (RE).

[6]  A Min Tjoa,et al.  Transformation of Requirement Specifications Expressed in Natural Language into an EER Model , 1993, ER.

[7]  Nazlia Omar,et al.  Heuristic-based entity-relationship modelling through natural language processing , 2004 .

[8]  Atul Gupta,et al.  A systematic literature review of use case specifications research , 2015, Inf. Softw. Technol..

[9]  Lionel C. Briand,et al.  A systematic review of transformation approaches between user requirements and analysis models , 2011, Requirements Engineering.

[10]  Sunil Vadera,et al.  Obtaining E-R Diagrams Semi-Automatically from Natural Language Specifications , 2004, ICEIS.

[11]  Robert J. Gaizauskas,et al.  CM-Builder: A Natural Language-Based CASE Tool for Object-Oriented Analysis , 2003, Automated Software Engineering.

[12]  Dan Klein,et al.  Fast Exact Inference with a Factored Model for Natural Language Parsing , 2002, NIPS.

[13]  Andrew Y. Ng,et al.  Parsing with Compositional Vector Grammars , 2013, ACL.

[14]  Christopher D. Manning,et al.  Incorporating Non-local Information into Information Extraction Systems by Gibbs Sampling , 2005, ACL.

[15]  Fernando Gomez,et al.  A System for the Semiautomatic Generation of E-R Models from Natural Language Specifications , 1999, Data Knowl. Eng..

[16]  Imran Sarwar Bajwa,et al.  A Framework for Automated Object Oriented Analysis of Natural Language Software Specifications , 2011 .

[17]  Lionel C. Briand,et al.  aToucan: An Automated Framework to Derive UML Analysis Models from Use Case Models , 2015, TSEM.

[18]  Priyanka More,et al.  Generating UML Diagrams from Natural Language Specifications , 2012 .

[19]  Owen Rambow,et al.  Conceptual modeling through linguistic analysis using LIDA , 2001, Proceedings of the 23rd International Conference on Software Engineering. ICSE 2001.

[20]  Beatrice Santorini,et al.  Building a Large Annotated Corpus of English: The Penn Treebank , 1993, CL.

[21]  Pankaj Jalote,et al.  An Integrated Approach to Software Engineering , 1991, Springer Compass International.

[22]  Sven Hartmann,et al.  English Sentence Structures and EER Modeling , 2007, APCCM.

[23]  Christopher D. Manning,et al.  Generating Typed Dependency Parses from Phrase Structure Parses , 2006, LREC.

[24]  Ivar Jacobson,et al.  Unified Modeling Language Reference Manual, The (2nd Edition) , 2004 .