Extracting static and dynamic model elements from textual specifications in humanities

Information systems are composed of different dimensions of information: methodological, structural, architectural, etc. These dimensions appear connected in textual specifications created at early stages of information systems conception. These multidimensional textual specifications are particularly relevant in cases of information systems conceived to support processes in narrative-based domains, such as humanities or social sciences. Working on these domains, a hard effort of dimension decoupling and a fully understanding of the domain is required. Thus, software analysts put a considerable effort in the identification of the most relevant concepts of the domain---to create the static structure of the system---and, in a separate manner, in the identification of the activities---to understand how this information is produced and managed by users in real practice---. This paper presents how existing studies aim to bridge this modelling gap between static and dynamic aspects in information systems from a comprehensive perspective. It also proposes a pipeline method to deal with this gap, using a semi-automatic analysis of textual sources. The pipeline focused on the identification of activities and static concepts, producing concept maps as static models and activity logs as dynamic models. The final goal of the proposed pipeline is reducing the domain understanding complexity required to software analysts in humanities information system conception and improving the quality of the models in both static and dynamic dimensions.

[1]  Luisa Mich,et al.  NL-OOPS: from natural language to object oriented requirements using the natural language processing system LOLITA , 1996, Natural Language Engineering.

[2]  Leonid Kof,et al.  Scenarios: Identifying Missing Objects and Actions by Means of Computational Linguistics , 2007, 15th IEEE International Requirements Engineering Conference (RE 2007).

[3]  Lars Rosenhainer Identifying Crosscutting Concerns in Requirements Specifications , 2004 .

[4]  Natalia Juristo Juzgado,et al.  A formal approach for generating oo specifications from natural language , 1999, J. Syst. Softw..

[5]  Brian Henderson-Sellers,et al.  A Metamodel for Assessable Software Development Methodologies , 2005, Software Quality Journal.

[6]  Denis Silva da Silveira,et al.  A Framework to Diminish the Gap between the Business Specialist and the Software Designer , 2012, 2012 Eighth International Conference on the Quality of Information and Communications Technology.

[7]  Christopher D. Manning,et al.  Enriching the Knowledge Sources Used in a Maximum Entropy Part-of-Speech Tagger , 2000, EMNLP.

[8]  Ivar Jacobson,et al.  Unified Modeling Language Reference Manual, The (2nd Edition) , 2004 .

[9]  John P. McCrae,et al.  CILI: the Collaborative Interlingual Index , 2016, GWC.

[10]  Patricia Martín-Rodilla,et al.  Automatic process model discovery from textual methodologies , 2015, 2015 IEEE 9th International Conference on Research Challenges in Information Science (RCIS).

[11]  Cesar Gonzalez-Perez Filling the Voids - From Requirements to Deployment with OPEN/Metis , 2010, ICSOFT.

[12]  Colette Rolland,et al.  Conceptual Modeling and Natural Language Analysis , 2013, Seminal Contributions to Information Systems Engineering.

[13]  Wil M. P. van der Aalst,et al.  Process Mining - Discovery, Conformance and Enhancement of Business Processes , 2011 .

[14]  Alberto Trombetta,et al.  BPMN: An introduction to the standard , 2012, Comput. Stand. Interfaces.

[15]  João Araújo,et al.  Early aspects: a model for aspect-oriented requirements engineering , 2002, Proceedings IEEE Joint International Conference on Requirements Engineering.

[16]  Holger Herbst,et al.  Business rule oriented conceptual modeling , 2000 .

[17]  Ruzanna Chitchyan,et al.  EA-Miner: a tool for automating aspect-oriented requirements identification , 2005, ASE.

[18]  Jeremy Gibbons,et al.  Formalisations and applications of BPMN , 2011, Sci. Comput. Program..

[19]  Ivar Jacobson,et al.  The unified modeling language reference manual , 2010 .

[20]  George A. Miller,et al.  WordNet: A Lexical Database for English , 1995, HLT.

[21]  Maria Pia di Buono,et al.  Decision making support systems for the Archaeological domain: A Natural Language Processing proposal , 2013, 2013 Digital Heritage International Congress (DigitalHeritage).

[22]  G. Miller,et al.  A Semantic Network of English Verbs , 1998 .

[23]  Peter Sawyer,et al.  The REVERE Project: Experiments with the Application of Probabilistic NLP to Systems Engineering , 2000, NLDB.