Variational Autoencoder for Anomaly Detection in Event Data in Online Process Mining

The analysis of event data recorded by information systems is becoming increasingly relevant. An increasing data-centric analysis of processes by using process mining techniques has a direct impact on the management of business processes. To achieve a positive impact on business process management, a high quality data basis is important. This paper presents an approach for the application of variational autoencoder for the filtering of anomalous event data in an online process mining environment, which help to improve the results of process mining techniques and thus positively influence business process management. For anomaly detection in an unsupervised environment, mass-volume and excess-mass scores are used as metrics. The results are compared on the basis of established algorithms such as one-class support vector machine, isolation forest and local outlier factor. These insights are used to highlight the benefits of this approach for process mining and business process management.

[1]  Radford M. Neal Pattern Recognition and Machine Learning , 2007, Technometrics.

[2]  Bernhard Schölkopf,et al.  Support Vector Method for Novelty Detection , 1999, NIPS.

[3]  Max Mühlhäuser,et al.  Analyzing business process anomalies using autoencoders , 2018, Machine Learning.

[4]  Arthur H. M. ter Hofstede,et al.  Filtering Out Infrequent Behavior from Business Process Event Logs , 2017, IEEE Transactions on Knowledge and Data Engineering.

[5]  Jianmin Wang,et al.  Cleaning structured event logs: A graph repair approach , 2015, 2015 IEEE 31st International Conference on Data Engineering.

[6]  Max Welling,et al.  Auto-Encoding Variational Bayes , 2013, ICLR.

[7]  Han-Lim Choi,et al.  Deep Generative Models-Based Anomaly Detection for Spacecraft Control Systems , 2020, Sensors.

[8]  Nathan Marz,et al.  Big Data: Principles and best practices of scalable realtime data systems , 2015 .

[9]  Diederik P. Kingma,et al.  An Introduction to Variational Autoencoders , 2019, Found. Trends Mach. Learn..

[10]  Marcello La Rosa,et al.  Filtering Spurious Events from Event Streams of Business Processes , 2018, CAiSE.

[11]  John Cristian Borges Gamboa,et al.  Deep Learning for Time-Series Analysis , 2017, ArXiv.

[12]  Bogdan Franczyk,et al.  Lambda Architecture for Anomaly Detection in Online Process Mining Using Autoencoders , 2020, ICCCI.

[13]  S. P. Lloyd,et al.  Least squares quantization in PCM , 1982, IEEE Trans. Inf. Theory.

[14]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[15]  Nicolas Goix,et al.  How to Evaluate the Quality of Unsupervised Anomaly Detection Algorithms? , 2016, ArXiv.

[16]  Wil M. P. van der Aalst,et al.  Workflow mining: discovering process models from event logs , 2004, IEEE Transactions on Knowledge and Data Engineering.

[17]  Hans-Peter Kriegel,et al.  LOF: identifying density-based local outliers , 2000, SIGMOD '00.

[18]  Huaiyu Zhu On Information and Sufficiency , 1997 .

[19]  Sander J. J. Leemans,et al.  Discovering Block-Structured Process Models from Event Logs - A Constructive Approach , 2013, Petri Nets.

[20]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[21]  Carl Doersch,et al.  Tutorial on Variational Autoencoders , 2016, ArXiv.

[22]  Han-Lim Choi,et al.  Learning-Based Anomaly Detection and Monitoring for Swarm Drone Flights , 2019 .

[23]  Marcello La Rosa,et al.  Predictive Business Process Monitoring via Generative Adversarial Nets: The Case of Next Event Prediction , 2020, BPM.

[24]  Marlon Dumas,et al.  Detecting Sudden and Gradual Drifts in Business Processes from Execution Traces , 2017, IEEE Transactions on Knowledge and Data Engineering.

[25]  Zhi-Hua Zhou,et al.  Isolation Forest , 2008, 2008 Eighth IEEE International Conference on Data Mining.

[26]  Martin Wickler,et al.  Applications of Deep Learning Neural Networks to Satellite Telemetry Monitoring , 2018 .

[27]  Boudewijn F. van Dongen,et al.  Detecting Drift from Event Streams of Unpredictable Business Processes , 2016, ER.