A policy-based process mining framework: mining business policy texts for discovering process models

Many organizations use business policies to govern their business processes, often resulting in huge amounts of policy documents. As new regulations arise such as Sarbanes-Oxley, these business policies must be modified to ensure their correctness and consistency. Given the large amounts of business policies, manually analyzing policy documents to discover process information is very time-consuming and imposes excessive workload. In order to provide a solution to this information overload problem, we propose a novel approach named Policy-based Process Mining (PBPM) to automatically extracting process information from policy documents. Several text mining algorithms are applied to business policy texts in order to discover process-related policies and extract such process components as tasks, data items, and resources. Experiments are conducted to validate the extracted components and the results are found to be very promising. To the best of our knowledge, PBPM is the first approach that applies text mining towards discovering business process components from unstructured policy documents. The initial research results presented in this paper will require more research efforts to make PBPM a practical solution.

[1]  Andrew McCallum,et al.  Maximum Entropy Markov Models for Information Extraction and Segmentation , 2000, ICML.

[2]  William J. Kettinger,et al.  Business Process Change: A Study of Methodologies, Techniques, and Tools , 1997, MIS Q..

[3]  Razvan C. Bunescu,et al.  A Shortest Path Dependency Kernel for Relation Extraction , 2005, HLT.

[4]  Charles G. Cobb Enterprise Process Mapping: Integrating Systems For Compliance And Business Excellence , 2004 .

[5]  Wil M. P. van der Aalst,et al.  Piet's Razor Applied to BPR: Reengineering Knock-out Processes , 2002 .

[6]  Nello Cristianini,et al.  Classification using String Kernels , 2000 .

[7]  Robert W. Blanning,et al.  A Formal Approach to Workflow Analysis , 2000, Inf. Syst. Res..

[8]  Boudewijn F. van Dongen,et al.  Business process mining: An industrial application , 2007, Inf. Syst..

[9]  Dmitry Zelenko,et al.  Kernel methods for relation extraction , 2003 .

[10]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[11]  Nello Cristianini,et al.  An Introduction to Support Vector Machines and Other Kernel-based Learning Methods , 2000 .

[12]  Jan Mendling,et al.  Business Process Intelligence , 2009, Handbook of Research on Business Process Modeling.

[13]  Akhil Kumar,et al.  Research Commentary: Workflow Management Issues in e-Business , 2002, Inf. Syst. Res..

[14]  Wil M. P. van der Aalst,et al.  Process mining: a research agenda , 2004, Comput. Ind..

[15]  Elaine Marsh,et al.  MUC-7 Evaluation of IE Technology: Overview of Results , 1998, MUC.

[16]  Anindya Datta,et al.  Automating the Discovery of AS-IS Business Process Models: Probabilistic and Algorithmic Approaches , 1998, Inf. Syst. Res..

[17]  Hajo A. Reijers,et al.  Product based workflow design with case handling systems , 2006 .

[18]  Fernando Pereira,et al.  Shallow Parsing with Conditional Random Fields , 2003, NAACL.

[19]  August-Wilhelm Scheer,et al.  ARIS - Business Process Modeling , 1998 .

[20]  V. Daniel Hunt,et al.  Process Mapping: How to Reengineer Your Business Processes , 1996 .

[21]  Aron Culotta,et al.  Dependency Tree Kernels for Relation Extraction , 2004, ACL.

[22]  Rudolf Vetschera,et al.  Algorithmical approaches to business process design , 2001, Comput. Oper. Res..

[23]  Tariq Aldowaisan,et al.  Business process reengineering: an approach for process mapping , 1999 .

[24]  Andrew McCallum,et al.  Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data , 2001, ICML.

[25]  Thomas Peltier Information Security: Policies and Procedures: A Practitioner's Reference , 1998 .

[26]  J. Leon Zhao,et al.  Policy-Driven Process Mapping (PDPM): Towards Process Design Automation , 2006, ICIS.

[27]  Michael Collins,et al.  Convolution Kernels for Natural Language , 2001, NIPS.