A markov prediction model for data-driven semi-structured business processes

In semi-structured case-oriented business processes, the sequence of process steps is determined by case workers based on available document content associated with a case. Transitions between process execution steps are therefore case specific and depend on independent judgment of case workers. In this paper, we propose an instance-specific probabilistic process model (PPM) whose transition probabilities are customized to the semi-structured business process instance it represents. An instance-specific PPM serves as a powerful representation to predict the likelihood of different outcomes. We also show that certain instance-specific PPMs can be transformed into a Markov chain under some non-restrictive assumptions. For instance-specific PPMs that contain parallel execution of tasks, we provide an algorithm to map them to an extended space Markov chain. This way existing Markov techniques can be leveraged to make predictions about the likelihood of executing future tasks. Predictions provided by our technique could generate early alerts for case workers about the likelihood of important or undesired outcomes in an executing case instance. We have implemented and validated our approach on a simulated automobile insurance claims handling semi-structured business process. Results indicate that an instance-specific PPM provides more accurate predictions than other methods such as conditional probability. We also show that as more document data become available, the prediction accuracy of an instance-specific PPM increases.

[1]  Kim-Leng Poh,et al.  An Intelligent Decision Support System for Investment Analysis , 2000, Knowledge and Information Systems.

[2]  Tadao Murata,et al.  Petri nets: Properties, analysis and applications , 1989, Proc. IEEE.

[3]  Jan Mendling,et al.  Business Process Intelligence , 2009, Handbook of Research on Business Process Modeling.

[4]  Kurt Jensen,et al.  Coloured Petri Nets: Basic Concepts, Analysis Methods and Practical Use. Vol. 2, Analysis Methods , 1992 .

[5]  Francisco Curbera,et al.  Predictive Analytics for Semi-structured Case Oriented Business Processes , 2010, Business Process Management Workshops.

[6]  Shaofeng Liu,et al.  Integration of decision support systems to improve decision support performance , 2010, Knowledge and Information Systems.

[7]  H. M. Taylor,et al.  An introduction to stochastic modeling , 1985 .

[8]  อนิรุธ สืบสิงห์,et al.  Data Mining Practical Machine Learning Tools and Techniques , 2014 .

[9]  Anindya Datta,et al.  Automating the Discovery of AS-IS Business Process Models: Probabilistic and Algorithmic Approaches , 1998, Inf. Syst. Res..

[10]  Wil M. P. van der Aalst,et al.  Time prediction based on process mining , 2011, Inf. Syst..

[11]  Sriraam Natarajan,et al.  A relational hierarchical model for decision-theoretic assistance , 2011, Knowledge and Information Systems.

[12]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[13]  Boudewijn F. van Dongen,et al.  Supporting Flexible Processes through Recommendations Based on History , 2008, BPM.

[14]  Wil M. P. van der Aalst,et al.  Process Mining - Discovery, Conformance and Enhancement of Business Processes , 2011 .

[15]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques, 3rd Edition , 1999 .

[16]  Ian Witten,et al.  Data Mining , 2000 .

[17]  Angélica González,et al.  Combining case-based reasoning systems and support vector regression to evaluate the atmosphere–ocean interaction , 2010, Knowledge and Information Systems.

[18]  Boudewijn F. van Dongen,et al.  ProM: The Process Mining Toolkit , 2009, BPM.

[19]  John N. Tsitsiklis,et al.  Introduction to Probability , 2002 .

[20]  Frederick S. Hillier,et al.  Introduction of Operations Research , 1967 .

[21]  Kurt Jensen Coloured Petri Nets , 1992, EATCS Monographs in Theoretical Computer Science.

[22]  Mathias Weske,et al.  Case handling: a new paradigm for business process support , 2005, Data Knowl. Eng..

[23]  Ibm Redbooks,et al.  Advanced Case Management With IBM Case Manager , 2011 .

[24]  Aleksander Slominski,et al.  Discovering event correlation rules for semi-structured business processes , 2011, DEBS '11.

[25]  Boudewijn F. van Dongen,et al.  Workflow mining: A survey of issues and approaches , 2003, Data Knowl. Eng..

[26]  Yurdaer N. Doganata,et al.  Business Provenance - A Technology to Increase Traceability of End-to-End Operations , 2008, OTM Conferences.

[27]  Charles M. Grinstead,et al.  Introduction to probability , 1999, Statistics for the Behavioural Sciences.

[28]  Hajo A. Reijers,et al.  Product-based workflow support , 2011, Inf. Syst..

[29]  Dimitris Karagiannis,et al.  Integrating machine learning and workflow management to support acquisition and adaptation of workflow models , 1998, Proceedings Ninth International Workshop on Database and Expert Systems Applications (Cat. No.98EX130).

[30]  F. S. Hillier,et al.  Introduction to Operations Research, 10th ed. , 1986 .

[31]  Joachim Herbst,et al.  A Machine Learning Approach to Workflow Management , 2000, ECML.

[32]  Boudewijn F. van Dongen,et al.  Business process mining: An industrial application , 2007, Inf. Syst..

[33]  W. Feller,et al.  An Introduction to Probability Theory and Its Applications, Vol. 1 , 1967 .

[34]  金田 重郎,et al.  C4.5: Programs for Machine Learning (書評) , 1995 .

[35]  Alexandre Zenie,et al.  Colored Stochastic Petri Nets , 1985, PNPM.

[36]  Francisca Santana Robles,et al.  Coloured Petri Nets Basic Concepts, Analysis Methods and Practical Use , 2015 .

[37]  Wil M. P. van der Aalst,et al.  Decision Mining in ProM , 2006, Business Process Management.

[38]  Geetika T. Lakshmanan,et al.  Leveraging Process-Mining Techniques , 2013, IT Professional.

[39]  Alexander L. Wolf,et al.  Discovering models of software processes from event-based data , 1998, TSEM.

[40]  Boudewijn F. van Dongen,et al.  Cycle Time Prediction: When Will This Case Finally Be Finished? , 2008, OTM Conferences.

[41]  Avi Pfeffer Functional Specification of Probabilistic Process Models , 2005, AAAI.

[42]  Moe Thandar Wynn,et al.  Workflow simulation for operational decision support , 2009, Data Knowl. Eng..