Innovations in Natural Language Document Processing for Requirements Engineering

This paper evaluates the potential contributions of natural language processing to requirements engineering. We present a selective history of the relationship between requirements engineering (RE) and natural-language processing (NLP), and briefly summarize relevant recent trends in NLP. The paper outlines basic issues in RE and how they relate to interactions between a NLP front end and system-development processes. We suggest some improvements to NLP that may be possible in the context of RE and conclude with an assessment of what should be done to improve likelihood of practical impact in this direction.

[1]  Walter Daelemans,et al.  Improving Accuracy in word class tagging through the Combination of Machine Learning Systems , 2001, CL.

[2]  Anoop Sarkar,et al.  Corrected Co-training for Statistical Parsers , 2003 .

[3]  Eugene Charniak,et al.  Statistical Techniques for Natural Language Parsing , 1997, AI Mag..

[4]  Burt Leavenworth Proceedings of the ACM SIGPLAN symposium on Very high level languages , 1974 .

[5]  Luqi,et al.  Documentation driven development for complex real-time systems , 2004, IEEE Transactions on Software Engineering.

[6]  Daniel Dominic Sleator,et al.  Parsing English with a Link Grammar , 1995, IWPT.

[7]  Sven Hartrumpf,et al.  Coreference resolution with syntactico-semantic rules and corpus statistics , 2001, CoNLL.

[8]  Beum-Seuk Lee,et al.  Automation of Software System Development Using Natural Language Processing and Two-Level Grammar , 2002, RISSEF.

[9]  Erik Kamsties,et al.  Higher quality requirements specifications through natural language patterns , 2003, Proceedings 2003 Symposium on Security and Privacy.

[10]  William A. Martin Some Comments On EQS, A Near Term Natural Language Data Base Query System , 1978, ACM Annual Conference.

[11]  Eugene Charniak,et al.  Effective Self-Training for Parsing , 2006, NAACL.

[12]  Michael Collins,et al.  Discriminative Reranking for Natural Language Parsing , 2000, CL.

[13]  Beatrice Santorini,et al.  Building a Large Annotated Corpus of English: The Penn Treebank , 1993, CL.

[14]  Peter Sawyer,et al.  Identifying tacit knowledge-based requirements , 2006, IEE Proc. Softw..

[15]  John Hale,et al.  A Statistical Approach to Anaphora Resolution , 1998, VLC@COLING/ACL.

[16]  E. Blasch,et al.  Assembling a distributed fused information-based human-computer cognitive decision making tool , 2000, IEEE Aerospace and Electronic Systems Magazine.

[17]  Robert Balzer,et al.  A 15 Year Perspective on Automatic Programming , 1985, IEEE Transactions on Software Engineering.

[18]  Rebecca Hwa Supervised Grammar Induction using Training Data with Limited Constituent Information , 1999, ACL.

[19]  Rebecca Hwa,et al.  A Backoff Model for Bootstrapping Resources for Non-English Languages , 2005, HLT/EMNLP.

[20]  Hinrich Schütze,et al.  Book Reviews: Foundations of Statistical Natural Language Processing , 1999, CL.

[21]  George A. Miller,et al.  Introduction to WordNet: An On-line Lexical Database , 1990 .

[22]  Fabrice Kordon,et al.  Advances in Requirements Engineering: Bridging the Gap between Stakeholders' Needs and Formal Designs , 2007, Monterey Workshop.

[23]  Michael Collins,et al.  Three Generative, Lexicalised Models for Statistical Parsing , 1997, ACL.

[24]  Betty H. C. Cheng,et al.  Research Directions in Requirements Engineering , 2007, Future of Software Engineering (FOSE '07).

[25]  Zhi Jin,et al.  Requirement specification in pseudo-natural language in PROMIS , 1995, Proceedings Nineteenth Annual International Computer Software and Applications Conference (COMPSAC'95).

[26]  Gregory R. Ruth Automatic programming: Automating the software system development process , 1977, ACM '77.

[27]  James Bret Michael,et al.  Natural-language processing support for developing policy-governed software systems , 2001, Proceedings 39th International Conference and Exhibition on Technology of Object-Oriented Languages and Systems. TOOLS 39.

[28]  James H. Martin,et al.  Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition , 2000 .

[29]  Wlodek Zadrozny,et al.  Introduction to the Special Issue on Context in Natural Language Processing , 1997, Comput. Intell..

[30]  Hermann Helbig,et al.  Knowledge Representation and the Semantics of Natural Language , 2005, Cognitive Technologies.

[31]  A. Ohnishi CARD: an environment for software requirements definition , 1995, Proceedings 1995 Asia Pacific Software Engineering Conference.

[32]  Barry W. Boehm,et al.  Software Engineering Economics , 1993, IEEE Transactions on Software Engineering.

[33]  Beth Levin,et al.  English Verb Classes and Alternations: A Preliminary Investigation , 1993 .

[34]  Hermann Helbig Knowledge Representation and the Semantics of Natural Language (Cognitive Technologies) , 2005 .

[35]  Miguel Toro,et al.  Verifying software requirements with XSLT , 2002, SOEN.

[36]  Luqi,et al.  Software engineering with abstractions , 1991 .

[37]  Rada Mihalcea,et al.  Unsupervised Large-Vocabulary Word Sense Disambiguation with Graph-based Algorithms for Sequence Data Labeling , 2005, HLT.

[38]  Mark Steedman,et al.  Bootstrapping statistical parsers from small datasets , 2003, EACL.

[39]  Michael Hammer,et al.  A very high level programming language for data processing applications , 1977, Commun. ACM.

[40]  Beum-Seuk Lee Automated conversion from a requirements document to an executable formal specification , 2001, Proceedings 16th Annual International Conference on Automated Software Engineering (ASE 2001).

[41]  Kevin Ryan,et al.  The role of natural language in requirements engineering , 1993, [1993] Proceedings of the IEEE International Symposium on Requirements Engineering.

[42]  George E. Heidorn,et al.  English as a very high level language for simulation programming , 1974, SIGPLAN Symposium on Very High Level Languages.

[43]  Mark Steedman,et al.  Example Selection for Bootstrapping Statistical Parsers , 2003, NAACL.

[44]  Colette Rolland,et al.  A Natural Language Approach for Requirements Engineering , 1992, CAiSE.

[45]  Bernard Mérialdo,et al.  Tagging English Text with a Probabilistic Model , 1994, CL.

[46]  Donald Sannella,et al.  Toward Component-Oriented Formal Software Development: An Algebraic Approach , 2002, RISSEF.

[47]  Hermann Kaindl,et al.  The missing link in requirements engineering , 1993, SOEN.

[48]  Diane Kelly A Software Chasm: Software Engineering and Scientific Computing , 2007, IEEE Software.

[49]  Vincenzo Gervasi,et al.  Processing natural language requirements , 1997, Proceedings 12th IEEE International Conference Automated Software Engineering.

[50]  Julian M. Kupiec,et al.  Robust part-of-speech tagging using a hidden Markov model , 1992 .

[51]  Spencer Rugaber,et al.  Requirements validation via automated natural language parsing , 1995 .