From Informal Process Diagrams to Formal Process Models

Process modeling is an important activity in business transformation projects. Free-form diagramming tools, such as PowerPoint and Visio, are the preferred tools for creating process models. However, the designs created using such tools are informal sketches, which are not amenable to automated analysis. Formal models, although desirable, are rarely created (during early design) because of the usability problems associated with formal-modeling tools. In this paper, we present an approach for automatically inferring formal process models from informal business process diagrams, so that the strengths of both types of tools can be leveraged. We discuss different sources of structural and semantic ambiguities, commonly present in informal diagrams, which pose challenges for automated inference. Our approach consists of two phases. First, it performs structural inference to identify the set of nodes and edges that constitute a process model. Then, it performs semantic interpretation, using a classifier that mimics human reasoning to associate modeling semantics with the nodes and edges. We discuss both supervised and unsupervised techniques for training such a classifier. Finally, we report results of empirical studies, conducted using flow diagrams from real projects, which illustrate the effectiveness of our approach.

[1]  Dean Rubine,et al.  Specifying gestures by example , 1991, SIGGRAPH.

[2]  Eric J. Golin,et al.  The specification of visual language syntax , 1990, J. Vis. Lang. Comput..

[3]  P. Cochat,et al.  Et al , 2008, Archives de pediatrie : organe officiel de la Societe francaise de pediatrie.

[4]  Anil K. Jain,et al.  Algorithms for Clustering Data , 1988 .

[5]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[6]  Mark D. Gross,et al.  Recognizing and interpreting diagrams in design , 1994, AVI '94.

[7]  P. Héroux,et al.  Frequent Graph Discovery: Application to Line Drawing Document Images , 2005 .

[8]  Anil K. Jain,et al.  Statistical Pattern Recognition: A Review , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[9]  Sankar K. Pal,et al.  Multilayer perceptron, fuzzy sets, and classification , 1992, IEEE Trans. Neural Networks.

[10]  Randall Davis,et al.  Tahuti: a geometrical sketch recognition system for UML class diagrams , 2006, SIGGRAPH Courses.

[11]  Ioannis A. Kakadiaris,et al.  Understanding diagrams in technical documents , 1992, Computer.

[12]  John C. Grundy,et al.  SUMLOW: early design‐stage sketching of UML diagrams on an E‐whiteboard , 2008, Softw. Pract. Exp..

[13]  Pradeep Ravikumar,et al.  A Comparison of String Distance Metrics for Name-Matching Tasks , 2003, IIWeb.

[14]  Juhani Iivari,et al.  Why are CASE tools not used? , 1996, CACM.

[15]  Pierre Dillenbourg,et al.  What if the computer doesn't know the answer? , 1996, CACM.

[16]  Louis Weitzman,et al.  Relational Grammars: Theory and Practice in a Visual Language Interface for Process Modeling , 1996 .

[17]  David G. Stork,et al.  Pattern Classification , 1973 .

[18]  Takayuki Dan Kimura,et al.  Recognizing multistroke geometric shapes: an experimental evaluation , 1993, UIST '93.