Visual Filtering Tools and Analysis of Case Groups for Process Discovery

Dealing with average-sized event logs is considered a challenging task in process mining, in order to give value to event log data created by a wide variety of systems. An event log consists of a sequence of events for every case that was handled by the system. Discovery algorithms proposed in the literature work well in specific cases, but they usually fail in generic ones. Furthermore, there is no evidence that those existing strategies can handle logs with a large number of variants. We lack a generic approach to allow experts to explore event log data and decompose information into a series of smaller problems, to identify not only outliers, but also relations between the analyzed cases. In this chapter we propose a visual approach for filtering processes based on a low dimensionality representation of cases, a dissimilarity function based on both case attributes and case paths, and the use of entropy and silhouette to evaluate the uncertainty and quality, respectively, of each subset of cases. For each subset of cases, it is possible to reconstruct and evaluate each process model. Those contributions can be combined in an interactive tool to support process discovery. To demonstrate our tool, we use the event log from BPI Challenge 2017.

[1]  Luis Gustavo Nonato,et al.  Local Affine Multidimensional Projection , 2011, IEEE Transactions on Visualization and Computer Graphics.

[2]  Claude E. Shannon,et al.  The Mathematical Theory of Communication , 1950 .

[3]  Daniel A. Keim,et al.  Visual exploration of large data sets , 2001, Commun. ACM.

[4]  P. Rousseeuw Silhouettes: a graphical aid to the interpretation and validation of cluster analysis , 1987 .

[5]  Andreas Buja,et al.  Interactive data visualization using focusing and linking , 1991, Proceeding Visualization '91.

[6]  Wil M. P. van der Aalst,et al.  Divide and Conquer: A Tool Framework for Supporting Decomposed Discovery in Process Mining , 2017, Comput. J..

[7]  Boudewijn F. van Dongen,et al.  The ProM Framework: A New Era in Process Mining Tool Support , 2005, ICATPN.

[8]  Boudewijn F. van Dongen,et al.  On the Role of Fitness, Precision, Generalization and Simplicity in Process Discovery , 2012, OTM Conferences.

[9]  Joydeep Ghosh,et al.  Cluster Ensembles --- A Knowledge Reuse Framework for Combining Multiple Partitions , 2002, J. Mach. Learn. Res..

[10]  Vladimir I. Levenshtein,et al.  Binary codes capable of correcting deletions, insertions, and reversals , 1965 .

[11]  Haim Levkowitz,et al.  Projection inspector: Assessment and synthesis of multidimensional projections , 2015, Neurocomputing.

[12]  Ashutosh Tiwari,et al.  A review of business process mining: state-of-the-art and future trends , 2008, Bus. Process. Manag. J..

[13]  Ali S. Hadi,et al.  Finding Groups in Data: An Introduction to Chster Analysis , 1991 .

[14]  Wil M. P. van der Aalst,et al.  Fuzzy Mining - Adaptive Process Simplification Based on Multi-perspective Metrics , 2007, BPM.

[15]  Weidong Chen,et al.  Node-Pancyclic Properties of Biswapped Networks Based on Cycles in Their Factor Networks , 2017, Comput. J..

[16]  Boudewijn F. van Dongen,et al.  ProM 6: The Process Mining Toolkit , 2010, BPM.

[17]  Moe Thandar Wynn,et al.  Change visualisation: Analysing the resource and timing differences between two event logs , 2017, Inf. Syst..

[18]  Jan Mendling,et al.  Challenges of smart business process management: An introduction to the special issue , 2017, Decis. Support Syst..

[19]  Wil M. P. van der Aalst,et al.  Workflow mining: discovering process models from event logs , 2004, IEEE Transactions on Knowledge and Data Engineering.

[20]  Christian W. Günther,et al.  Disco: Discover Your Processes , 2012, BPM.

[21]  Simone Diniz Junqueira Barbosa,et al.  Visual Support to Filtering Cases for Process Discovery , 2018, ICEIS.

[22]  Richard A. Becker,et al.  Brushing scatterplots , 1987 .

[23]  Jochen De Weerdt,et al.  Multi-objective Trace Clustering: Finding More Balanced Solutions , 2016, Business Process Management Workshops.

[24]  P. Jaccard,et al.  Etude comparative de la distribution florale dans une portion des Alpes et des Jura , 1901 .

[25]  Thomas M. Cover,et al.  Elements of Information Theory , 2005 .

[26]  Wil M. P. van der Aalst,et al.  Process mining: a research agenda , 2004, Comput. Ind..