Using classification techniques for informal requirements in the requirements analysis-supporting system

In order to efficiently develop large-scale and complicated software, it is important for system engineers to correctly understand users' requirements. Most requirements in large-scale projects are collected from various stakeholders located in various regions, and they are generally written in natural language. Therefore, the initial collected requirements must be classified into various topics prior to analysis phases in order to be usable as input in several requirements analysis methods. If this classification process is manually done by analysts, it becomes a time-consuming task. To solve this problem, we propose a new bootstrapping method which can automatically classify requirements sentences into each topic category using only topic words as the representative of the analysts' views. The proposed method is verified through experiments using two requirements data sets: one written in English and the other in Korean. The significant performances were achieved in the experiments: the 84.28 and 87.91 F1 scores for the English and Korean data sets, respectively. As a result, the proposed method can provide an effective function for an Internet-based requirements analysis-supporting system so as to efficiently gather and analyze requirements from various and distributed stakeholders by using the Internet.

[1]  Jay F. Nunamaker,et al.  Automatic concept classification of text from electronic meetings , 1994, CACM.

[2]  Björn Regnell,et al.  A Feasibility Study of Automated Natural Language Requirements Analysis in Market-Driven Development , 2002, Requirements Engineering.

[3]  Sooyong Park,et al.  Web-based Requirements Elicitation Supporting System using Requirements Sentences Categorization , 2000 .

[4]  Jacob L. Cybulski,et al.  Computer-assisted analysis and refinement of informal software requirements documents , 1998, Proceedings 1998 Asia Pacific Software Engineering Conference (Cat. No.98EX240).

[5]  Alan M. Davis More Words Of Wisdom , 1998, IEEE Software.

[6]  Daniela E. Damian,et al.  The impact of stakeholders' geographical distribution on managing requirements in a multi-site organization , 2002, Proceedings IEEE Joint International Conference on Requirements Engineering.

[7]  Stefania Gnesi,et al.  Applications of linguistic techniques for use case analysis , 2003, Requirements Engineering.

[8]  Claire Cardie,et al.  An Analysis of Statistical and Syntactic Phrases , 1997, RIAO.

[9]  Joseph A. Goguen,et al.  Techniques for requirements elicitation , 1993, [1993] Proceedings of the IEEE International Symposium on Requirements Engineering.

[10]  BrillEric,et al.  Transformation-based error-driven learning and natural language processing , 1995 .

[11]  Shimon Edelman,et al.  Similarity-based Word Sense Disambiguation , 1998, CL.

[12]  Björn Regnell,et al.  A linguistic-engineering approach to large-scale requirements management , 2005, IEEE Software.

[13]  Jinwoo Park,et al.  Improving text categorization using the importance of sentences , 2004, Inf. Process. Manag..

[14]  Ian Sommerville,et al.  Viewpoints: principles, problems and a practical approach to requirements engineering , 1997, Ann. Softw. Eng..

[15]  Youngjoong Ko,et al.  Using the feature projection technique based on a normalized voting method for text classification , 2004, Inf. Process. Manag..

[16]  Sooyong Park,et al.  Implementation of an efficient requirements-analysis supporting system using similarity measure techniques , 2000, Inf. Softw. Technol..

[17]  Alan M. Davis Predictions and Farewells , 1998, IEEE Softw..

[18]  Mark Klein,et al.  Attribute-Based Architectural Styles , 1999 .

[19]  Youngjoong Ko,et al.  Learning with Unlabeled Data for Text Categorization Using a Bootstrapping and a Feature Projection Technique , 2004, ACL.

[20]  Luqi,et al.  Gathering requirements from remote users , 1997, Proceedings Ninth IEEE International Conference on Tools with Artificial Intelligence.

[21]  Yiming Yang,et al.  An Evaluation of Statistical Approaches to Text Categorization , 1999, Information Retrieval.

[22]  Tom M. Mitchell,et al.  Using unlabeled data to improve text classification , 2001 .

[23]  Eric Brill,et al.  Transformation-Based Error-Driven Learning and Natural Language Processing: A Case Study in Part-of-Speech Tagging , 1995, CL.

[24]  Mark T True,et al.  Software Requirements , 2005 .

[25]  Brian Berenbach,et al.  Impact of organizational structure on distributed requirements engineering processes: lessons learned , 2006, GSD '06.

[26]  Yiqing Liang,et al.  Indexing and clustering of software requirements specifications , 1992 .

[27]  Ian Sommerville,et al.  Integrated requirements engineering: a tutorial , 2005, IEEE Software.

[28]  Sooyong Park,et al.  Web-based Requirements Elicitation Supporting System using Requirements Categorization , 2001 .

[29]  Tom M. Mitchell,et al.  Learning to construct knowledge bases from the World Wide Web , 2000, Artif. Intell..

[30]  Leonard J. Bass,et al.  Attribute-Based Architecture Styles , 1999, WICSA.

[31]  Kevin Ryan,et al.  The role of natural language in requirements engineering , 1993, [1993] Proceedings of the IEEE International Symposium on Requirements Engineering.

[32]  Gregory D. Schumacher,et al.  IEEE Guide for Developing System Requirements Specifications , 1999 .

[33]  Alexander Dekhtyar,et al.  Information Retrieval , 2018, Lecture Notes in Computer Science.

[34]  Barry W. Boehm,et al.  Requirements engineering, expectations management, and the Two Cultures , 1999, Proceedings IEEE International Symposium on Requirements Engineering (Cat. No.PR00188).

[35]  Sooyong Park,et al.  Automated support to system modeling from informal software requirements , 1994, SEKE.

[36]  Thorsten Joachims,et al.  Text Categorization with Support Vector Machines: Learning with Many Relevant Features , 1998, ECML.

[37]  Dean Leffingwell,et al.  Managing software requirements: a unified approach , 1999 .