An ontology-based text processing approach for simplifying ambiguity of requirement specifications

In the last few years, several works in the literature of software engineering have addressed the problem of requirement management. A majority problem of software errors is introduced during the requirements phase because much of requirements specification is written in natural language format. As this, it is hard to identify consistencies because of too ambiguous for specification purpose. Therefore, this paper aims to propose a method for simplifying ambiguity of requirement specification documents through two concepts of ontology-based probabilistic text processing: Text classification and Text Filtering. Text classification is used to analyze and classify requirement specification having similar detail into the same class. This contributes to a better understanding of the impact of the requirements and to elaborate them. Meanwhile, text filters are used to leverage synopsis requirements in documents through probabilistic text classification technique. After testing by F-measure, the experimental results return a satisfactory accuracy. These demonstrate that our method may provide more effectiveness for simplifying ambiguity of requirement specifications.

[1]  Hajime Enomoto,et al.  Software development process from natural language specification , 1989, ICSE '89.

[2]  Barry W. Boehm,et al.  Software Engineering Economics , 1993, IEEE Transactions on Software Engineering.

[3]  Jiawei Han,et al.  Data Mining: Concepts and Techniques , 2000 .

[4]  Anthony Hall,et al.  Seven myths of formal methods , 1990, IEEE Software.

[5]  Olga Ormandjieva,et al.  Toward a text classification system for the quality assessment of software requirements written in natural language , 2007, SOQUA '07.

[6]  Gerhard Knolmayer,et al.  Document Classification Methods for Organizing Explicit Knowledge , 2002 .

[7]  Kevin Ryan,et al.  The role of natural language in requirements engineering , 1993, [1993] Proceedings of the IEEE International Symposium on Requirements Engineering.

[8]  Sebastian Thrun,et al.  Text Classification from Labeled and Unlabeled Documents using EM , 2000, Machine Learning.

[9]  Thomas E. Bell,et al.  Software requirements: Are they really a problem? , 1976, ICSE '76.

[10]  Nicola Guarino,et al.  Formal Ontology and Information Systems , 1998 .

[11]  Surapant Meknavin,et al.  Feature-based Thai Word Segmentation , 1997 .

[12]  Sooyong Park,et al.  Using classification techniques for informal requirements in the requirements analysis-supporting system , 2007, Inf. Softw. Technol..

[13]  Jonathan P. Bowen,et al.  Seven More Myths of Formal Methods , 1994, FME.

[14]  Thorsten Joachims,et al.  Transductive Inference for Text Classification using Support Vector Machines , 1999, ICML.

[15]  Jonathan P. Bowen,et al.  Introduction to Formal Methods , 2006 .

[16]  Michael Kifer,et al.  Logical foundations of object-oriented and frame-based languages , 1995, JACM.

[17]  Jantima Polpinij,et al.  An Ontology-Based Sentiment Classification Methodology for Online Consumer Reviews , 2008, 2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology.

[18]  Yiming Yang,et al.  A Comparative Study on Feature Selection in Text Categorization , 1997, ICML.

[19]  Wolfgang Mueller,et al.  An Introduction to Formal Methods , 2004 .

[20]  Jeannette M. Wing A specifier's introduction to formal methods , 1990, Computer.

[21]  Jantima Polpinij,et al.  Synopsis Information Extraction in Documents Through Probabilistic Text Classifiers , 2007, ICADL.

[22]  Chris Welty,et al.  FOIS introduction: Ontology---towards a new synthesis , 2001, FOIS.