ProcessTransformer: Predictive Business Process Monitoring with Transformer Network

Predictive business process monitoring focuses on predicting future characteristics of a running process using event logs. The foresight into process execution promises great potentials for efficient operations, better resource management, and effective customer services. Deep learning-based approaches have been widely adopted in process mining to address the limitations of classical algorithms for solving multiple problems, especially the next event and remaining-time prediction tasks. Nevertheless, designing a deep neural architecture that performs competitively across various tasks is challenging as existing methods fail to capture long-range dependencies in the input sequences and perform poorly for lengthy process traces. In this paper, we propose ProcessTransformer, an approach for learning high-level representations from event logs with an attention-based network. Our model incorporates long-range memory and relies on a self-attention mechanism to establish dependencies between a multitude of event sequences and corresponding outputs. We evaluate the applicability of our technique on nine real event logs. We demonstrate that the transformer-based model outperforms several baselines of prior techniques by obtaining on average above 80% accuracy for the task of predicting the next activity. Our method also perform competitively, compared to baselines, for the tasks of predicting event time and remaining time of a running case.

[1]  Annalisa Appice,et al.  Activity Prediction of Business Process Instances with Inception CNN Models , 2019, AI*IA.

[2]  Cheng Guo,et al.  Entity Embeddings of Categorical Variables , 2016, ArXiv.

[3]  Mathias Weske,et al.  Prediction of Remaining Service Execution Time Using Stochastic Petri Nets with Arbitrary Firing Delays , 2013, ICSOC.

[4]  Shiping Chen,et al.  A test-bed for the evaluation of business process prediction techniques , 2011, 7th International Conference on Collaborative Computing: Networking, Applications and Worksharing (CollaborateCom).

[5]  Oscar González Rojas,et al.  Learning Accurate LSTM Models of Business Processes , 2019, BPM.

[6]  Aditya K. Ghose,et al.  Memory-Augmented Neural Networks for Predictive Process Analytics , 2018, ArXiv.

[7]  Francesco Folino,et al.  Context-Aware Predictions on Business Processes: An Ensemble-Based Solution , 2012, NFMCP.

[8]  Quoc V. Le,et al.  Sequence to Sequence Learning with Neural Networks , 2014, NIPS.

[9]  Donato Malerba,et al.  Using Convolutional Neural Networks for Predictive Process Analytics , 2019, 2019 International Conference on Process Mining (ICPM).

[10]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[11]  J. Becker,et al.  An empirical comparison of deep-neural-network architectures for next activity prediction using context-enriched process event logs , 2020, ArXiv.

[12]  Lysandre Debut,et al.  HuggingFace's Transformers: State-of-the-art Natural Language Processing , 2019, ArXiv.

[13]  Yu Zhang,et al.  A Survey on Multi-Task Learning , 2017, IEEE Transactions on Knowledge and Data Engineering.

[14]  Lukasz Kaiser,et al.  Attention is All you Need , 2017, NIPS.

[15]  Stefanie Rinderle-Ma,et al.  LoGo: Combining Local and Global Techniques for Predictive Business Process Monitoring , 2020, CAiSE.

[16]  Marlon Dumas,et al.  Predictive Business Process Monitoring with LSTM Neural Networks , 2016, CAiSE.

[17]  Jana-Rebecca Rehse,et al.  A Deep Learning Approach for Predicting Process Behaviour at Runtime , 2016, Business Process Management Workshops.

[18]  Yoshua Bengio,et al.  Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.

[19]  Marcello La Rosa,et al.  Encoder-Decoder Generative Adversarial Nets for Suffix Generation and Remaining Time Predication of Business Process Models , 2020, ArXiv.

[20]  Wil M. P. van der Aalst,et al.  Time prediction based on process mining , 2011, Inf. Syst..

[21]  Chengfei Liu,et al.  Outcome-Oriented Predictive Process Monitoring with Attention-Based Bidirectional LSTM Neural Networks , 2019, 2019 IEEE International Conference on Web Services (ICWS).

[22]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[23]  Alessandro Sperduti,et al.  Time and activity sequence prediction of business process instances , 2016, Computing.

[24]  Sandro Pezzelle,et al.  The LAMBADA dataset: Word prediction requiring a broad discourse context , 2016, ACL.

[25]  Wil M. P. van der Aalst,et al.  Supporting Risk-Informed Decisions during Business Process Execution , 2013, CAiSE.

[26]  Jianmin Wang,et al.  MM-Pred: A Deep Predictive Model for Multi-attribute Event Sequence , 2019, SDM.

[27]  Alessandro Sperduti,et al.  LSTM networks for data-aware remaining time prediction of business process instances , 2017, 2017 IEEE Symposium Series on Computational Intelligence (SSCI).

[28]  Marlon Dumas,et al.  Outcome-Oriented Predictive Process Monitoring: Review and Benchmark , 2017 .

[29]  Bokyoung Kang,et al.  Periodic Performance Prediction for Real-time Business Process Monitoring , 2012, Ind. Manag. Data Syst..

[30]  Manuel Lama,et al.  Deep Learning for Predictive Business Process Monitoring: Review and Benchmark , 2021, IEEE Transactions on Services Computing.

[31]  Stephen Pauwels,et al.  Bayesian Network Based Predictions of Business Processes , 2020, BPM.

[32]  Marcello La Rosa,et al.  Predictive Business Process Monitoring via Generative Adversarial Nets: The Case of Next Event Prediction , 2020, BPM.

[33]  Martin Matzner,et al.  Time Matters: Time-Aware LSTMs for Predictive Business Process Monitoring , 2020, ArXiv.

[34]  Guy Lapalme,et al.  A systematic analysis of performance measures for classification tasks , 2009, Inf. Process. Manag..