Leveraging path information to generate predictions for parallel business processes

In semi-structured processes, the set of activities that need to be performed, their order and whether additional steps are required are determined by human judgment. There is a growing demand for operational support of such processes during runtime particularly in the form of predictions about the likelihood of future tasks. We address the problem of making predictions for a running instance of a semi-structured process that contains parallel execution paths where the execution path taken by a process instance influences its outcome. In particular, we consider five different models for how to represent an execution trace as a path attribute for training a prediction model. We provide a methodology to determine whether parallel paths are independent, and whether it is worthwhile to model execution paths as independent based on a comparison of the information gain obtained by dependent and independent path representations. We tested our methodology by simulating a marketing campaign as a business process model and selected decision trees as the prediction model. In the evaluation, we compare the complexity and prediction accuracy of a prediction model trained with five different models.

[1]  David S. Touretzky,et al.  Proceedings of the 1993 Connectionist Models Summer School , 2014 .

[2]  Wil M. P. van der Aalst,et al.  Decision Mining in ProM , 2006, Business Process Management.

[3]  Moe Thandar Wynn,et al.  Workflow simulation for operational decision support , 2009, Data Knowl. Eng..

[4]  Boudewijn F. van Dongen,et al.  Supporting Flexible Processes through Recommendations Based on History , 2008, BPM.

[5]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques, 3rd Edition , 1999 .

[6]  Abdelwahab Hamou-Lhadj,et al.  Compression techniques to simplify the analysis of large execution traces , 2002, Proceedings 10th International Workshop on Program Comprehension.

[7]  Hongyan Ma,et al.  Process-aware information systems: Bridging people and software through process technology , 2007, J. Assoc. Inf. Sci. Technol..

[8]  Boudewijn F. van Dongen,et al.  Business process mining: An industrial application , 2007, Inf. Syst..

[9]  Jan Mendling,et al.  Business Process Intelligence , 2009, Handbook of Research on Business Process Modeling.

[10]  Chong-Min Kyung,et al.  Improving Lookahead in Parallel Multiprocessor Simulation Using Dynamic Execution Path Prediction , 2006, 20th Workshop on Principles of Advanced and Distributed Simulation (PADS'06).

[11]  Wil M. P. van der Aalst,et al.  Time prediction based on process mining , 2011, Inf. Syst..

[12]  David Haussler,et al.  Quantifying Inductive Bias: AI Learning Algorithms and Valiant's Learning Framework , 1988, Artif. Intell..

[13]  Thomas G. Dietterich What is machine learning? , 2020, Archives of Disease in Childhood.

[14]  Daniel T. Larose,et al.  Discovering Knowledge in Data: An Introduction to Data Mining , 2005 .

[15]  Alva L. Couch,et al.  Portable execution traces for parallel program debugging and performance visualization , 1992, Proceedings Scalable High Performance Computing Conference SHPCC-92..

[16]  Lior Rokach,et al.  Data Mining And Knowledge Discovery Handbook , 2005 .

[17]  Manfred Reichert,et al.  Case-Base Maintenance for CCBR-Based Process Evolution , 2006, ECCBR.

[18]  Frits W. Vaandrager,et al.  Reconstructing Critical Paths from Execution Traces , 2012, 2012 IEEE 15th International Conference on Computational Science and Engineering.

[19]  Wil M. P. van der Aalst,et al.  Process Mining - Discovery, Conformance and Enhancement of Business Processes , 2011 .

[20]  Ricardo Seguel,et al.  Process Mining Manifesto , 2011, Business Process Management Workshops.

[21]  Sriraam Natarajan,et al.  A relational hierarchical model for decision-theoretic assistance , 2011, Knowledge and Information Systems.

[22]  Remco M. Dijkman,et al.  Semantics and analysis of business process models in BPMN , 2008, Inf. Softw. Technol..

[23]  Frank Leymann,et al.  Faster and More Focused Control-Flow Analysis for Business Process Models Through SESE Decomposition , 2007, ICSOC.

[24]  Boudewijn F. van Dongen,et al.  Cycle Time Prediction: When Will This Case Finally Be Finished? , 2008, OTM Conferences.

[25]  Avi Pfeffer Functional Specification of Probabilistic Process Models , 2005, AAAI.

[26]  อนิรุธ สืบสิงห์,et al.  Data Mining Practical Machine Learning Tools and Techniques , 2014 .

[27]  Brad Calder,et al.  Dynamic prediction of critical path instructions , 2001, Proceedings HPCA Seventh International Symposium on High-Performance Computer Architecture.

[28]  Hajo A. Reijers,et al.  Product Based Workflow Support: Dynamic Workflow Execution , 2008, CAiSE.

[29]  Wil M. P. van der Aalst,et al.  The Application of Petri Nets to Workflow Management , 1998, J. Circuits Syst. Comput..

[30]  Boudewijn F. van Dongen,et al.  Workflow mining: A survey of issues and approaches , 2003, Data Knowl. Eng..

[31]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .