Recognition of Business Process Elements in Natural Language Texts

Process modeling is a complex and important task in any business process management project. Gathering information to build a process model needs effort by analysts in different ways, such as interviews and document review. However, this documentation is not always well structured and can be difficult to be understood. Thus, techniques that allow the structuring and recognition of process elements in the documentation can help in the understanding of the process and, consequently, in the modeling activity. In this context, this paper proposes an approach to recognize business process elements in natural language texts. We defined a set of 32 mapping rules to recognize business process elements in texts using natural language processing techniques and which were identified through an empirical study in texts containing descriptions of a process. Furthermore, a prototype was developed and it showed promising results. The analyses of 70 texts revealed 73.61% precision, 70.15% recall and 71.82% F-measure. Moreover, two surveys showed that 93.33% of participants agree with the mapping rules and that the approach helps the analysts in both the time spent and the effort made in the process modeling task. This paper is a reiteration and an evolution of the work presented in Ferreira et al. [1].

[1]  Marti A. Hearst Text Tiling: Segmenting Text into Multi-paragraph Subtopic Passages , 1997, CL.

[2]  Jan Mendling,et al.  Modularity in Process Models: Review and Effects , 2008, BPM.

[3]  Flávia Maria Santoro,et al.  Let Me Tell You a Story - On How to Build Process Models , 2011, J. Univers. Comput. Sci..

[4]  Jan Mendling,et al.  A study on the effects of routing symbol design on process model comprehension , 2013, Decis. Support Syst..

[5]  Jan Mendling,et al.  Understanding the Occurrence of Errors in Process Models Based on Metrics , 2007, OTM Conferences.

[6]  Ted Briscoe,et al.  The Second Release of the RASP System , 2006, ACL.

[7]  Jan Mendling,et al.  What Makes Process Models Understandable? , 2007, BPM.

[8]  James F. Allen Natural language understanding , 1987, Bejnamin/Cummings series in computer science.

[9]  Mark Strembeck,et al.  Factors of process model comprehension - Findings from a series of experiments , 2012, Decis. Support Syst..

[10]  Ekkart Kindler,et al.  On the semantics of EPCs: Resolving the vicious circle , 2006, Data Knowl. Eng..

[11]  Zhu Zhang,et al.  A policy-based process mining framework: mining business policy texts for discovering process models , 2010, Inf. Syst. E Bus. Manag..

[12]  Eric Brill,et al.  A Simple Rule-Based Part of Speech Tagger , 1992, HLT.

[13]  Verena Geist,et al.  Event-Based Gateways: Open Questions and Inconsistencies , 2012, BPMN.

[14]  Flávia Maria Santoro,et al.  Business process mining from group stories , 2009, 2009 13th International Conference on Computer Supported Cooperative Work in Design.

[15]  Jan Mendling,et al.  Seven process modeling guidelines (7PMG) , 2010, Inf. Softw. Technol..

[16]  Hajo A. Reijers,et al.  Dealing with Behavioral Ambiguity in Textual Process Descriptions , 2016, BPM.

[17]  Forbes Ad,et al.  Classification-algorithm evaluation: five performance measures based on confusion matrices. , 1995 .

[18]  Stephen A. White,et al.  BPMN modeling and reference guide : understanding and using BPMN : develop rigorous yet understandable graphical representations of business processes , 2008 .

[19]  Hajo A. Reijers,et al.  Detecting Inconsistencies Between Process Models and Textual Descriptions , 2015, BPM.

[20]  Mathias Weske,et al.  Business Process Management: Concepts, Languages, Architectures , 2007 .

[21]  Jan Mendling,et al.  The Impact of Secondary Notation on Process Model Understanding , 2009, PoEM.

[22]  Marcelo Fantinato,et al.  Assisting Process Modeling by Identifying Business Process Elements in Natural Language Texts , 2017, ER Workshops.

[23]  Oskari Heinonen,et al.  Optimal Multi-Paragraph Text Segmentation by Dynamic Programming , 1998, ACL.

[24]  Manfred Reichert,et al.  Activity patterns in process-aware information systems: basic concepts and empirical evidence , 2009, Int. J. Bus. Process. Integr. Manag..

[25]  Jan Mendling,et al.  Managing Structural and Textual Quality of Business Process Models , 2012, SIMPDA.

[26]  Aditya K. Ghose,et al.  Process Discovery from Model and Text Artefacts , 2007, 2007 IEEE Congress on Services (Services 2007).

[27]  Henrik Leopold,et al.  Natural Language in Business Process Models , 2013, Lecture Notes in Business Information Processing.

[28]  Hinrich Schütze,et al.  Introduction to information retrieval , 2008 .

[29]  Mohak Shah,et al.  Evaluating Learning Algorithms: A Classification Perspective , 2011 .

[30]  Marcelo Fantinato,et al.  A Semi-automatic Approach to Identify Business Process Elements in Natural Language Texts , 2017, ICEIS.

[31]  Jan Mendling,et al.  Supporting Process Model Validation through Natural Language Generation , 2014, IEEE Transactions on Software Engineering.

[32]  Jan Mendling,et al.  An Approach to Support Process Model Validation based on Text Generation , 2013, EMISA Forum.

[33]  Graeme Hirst,et al.  Lexical Cohesion Computed by Thesaural relations as an indicator of the structure of text , 1991, CL.

[34]  Jan Mendling,et al.  Process Model Generation from Natural Language Text , 2011, CAiSE.

[35]  Marti A. Hearst Multi-Paragraph Segmentation Expository Text , 1994, ACL.