Data-aware remaining time prediction of business process instances

Accurate prediction of the completion time of a business process instance would constitute a valuable tool when managing processes under service level agreement constraints. Such prediction, however, is a very challenging task. A wide variety of factors could influence the trend of a process instance, and hence just using time statistics of historical cases cannot be sufficient to get accurate predictions. Here we propose a new approach where, in order to improve the prediction quality, both the control and the data flow perspectives are jointly used. To achieve this goal, our approach builds a process model which is augmented by time and data information in order to enable remaining time prediction. The remaining time prediction of a running case is calculated combining two factors: (a) the likelihood of all the following activities, given the data collected so far; and (b) the remaining time estimation given by a regression model built upon the data.

[1]  H WittenIan,et al.  The WEKA data mining software , 2009 .

[2]  Bernhard Schölkopf,et al.  A tutorial on support vector regression , 2004, Stat. Comput..

[3]  Boudewijn F. van Dongen,et al.  Cycle Time Prediction: When Will This Case Finally Be Finished? , 2008, OTM Conferences.

[4]  Yurdaer N. Doganata,et al.  A markov prediction model for data-driven semi-structured business processes , 2013, Knowledge and Information Systems.

[5]  Boudewijn F. van Dongen,et al.  Process mining: a two-step approach to balance between underfitting and overfitting , 2008, Software & Systems Modeling.

[6]  Wil M. P. van der Aalst,et al.  Process Mining - Discovery, Conformance and Enhancement of Business Processes , 2011 .

[7]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[8]  Hajo A. Reijers Case Prediction in BPM Systems : A Research Challenge , 2007 .

[9]  Frank Leymann,et al.  Runtime Prediction of Service Level Agreement Violations for Composite Services , 2009, ICSOC/ServiceWave Workshops.

[10]  D. Basak,et al.  Support Vector Regression , 2008 .

[11]  Francesco Folino,et al.  Discovering High-Level Performance Models for Ticket Resolution Processes , 2013, OTM Conferences.

[12]  Francesco Folino,et al.  Discovering Context-Aware Models for Predicting Business Process Performances , 2012, OTM Conferences.

[13]  Boudewijn F. van Dongen,et al.  ProM: The Process Mining Toolkit , 2009, BPM.

[14]  Nello Cristianini,et al.  Kernel Methods for Pattern Analysis , 2003, ICTAI.

[15]  Boudewijn F. van Dongen,et al.  Supporting Flexible Processes through Recommendations Based on History , 2008, BPM.

[16]  Petra Perner,et al.  Data Mining - Concepts and Techniques , 2002, Künstliche Intell..

[17]  Thomas G. Dietterich What is machine learning? , 2020, Archives of Disease in Childhood.

[18]  Jian Pei,et al.  Data Mining: Concepts and Techniques, 3rd edition , 2006 .

[19]  Wil M. P. van der Aalst,et al.  Time prediction based on process mining , 2011, Inf. Syst..

[20]  Christopher D. Manning,et al.  Introduction to Information Retrieval , 2010, J. Assoc. Inf. Sci. Technol..

[21]  Boudewijn F. van Dongen,et al.  ProM 6: The Process Mining Toolkit , 2010, BPM.

[22]  Alexander J. Smola,et al.  Support Vector Regression Machines , 1996, NIPS.