Identifying Cohorts: Recommending Drill-Downs Based on Differences in Behaviour for Process Mining

Process mining aims to obtain insights from event logs to improve business processes. In complex environments with large variances in process behaviour, analysing and making sense of such complex processes becomes challenging. Insights in such processes can be obtained by identifying sub-groups of traces (cohorts) and studying their differences. In this paper, we introduce a new framework that elicits features from trace attributes, measures the stochastic distance between cohorts defined by sets of these features, and presents this landscape of sets of features and their influence on process behaviour to users. Our framework differs from existing work in that it can take many aspects of behaviour into account, including the ordering of activities in traces (control flow),the relative frequency of traces (stochastic perspective), and cost. The framework has been instantiated and implemented, has been evaluated for feasibility on multiple publicly available real-life event logs, and evaluated on real-life case studies in two Australian universities

[1]  Marlon Dumas,et al.  Detecting Sudden and Gradual Drifts in Business Processes from Execution Traces , 2017, IEEE Transactions on Knowledge and Data Engineering.

[2]  Sander J. J. Leemans,et al.  PM ^2 : A Process Mining Project Methodology , 2015, CAiSE.

[3]  Boudewijn F. van Dongen,et al.  Business Process Comparison: A Methodology and Case Study , 2017, BIS.

[4]  Hajo A. Reijers,et al.  Comparing Business Process Variants Using Models and Event Logs , 2014, BMMDS/EMMSAD.

[5]  Marlon Dumas,et al.  Business Process Variant Analysis: Survey and Classification , 2021, Knowl. Based Syst..

[6]  Wil M. P. van der Aalst,et al.  A general process mining framework for correlating, predicting and clustering dynamic behavior based on event logs , 2016, Inf. Syst..

[7]  Marta Indulska,et al.  Automated insightful drill-down recommendations for learning analytics dashboards , 2020, LAK.

[8]  Wil M. P. van der Aalst,et al.  Finding Process Variants in Event Logs - (Short Paper) , 2017, OTM Conferences.

[9]  Wil M. P. van der Aalst,et al.  Comparative Process Mining in Education: An Approach Based on Process Cubes , 2013, SIMPDA.

[10]  Moe T. Wynn,et al.  Directly Follows-Based Process Mining: Exploration & a Case Study , 2019, 2019 International Conference on Process Mining (ICPM).

[11]  Wil M. P. van der Aalst,et al.  A Visual Approach to Spot Statistically-Significant Differences in Event Logs Based on Process Metrics , 2016, CAiSE.

[12]  Wil M. P. van der Aalst,et al.  Exploiting Process Cubes, Analytic Workflows and Process Mining for Business Process Reporting: A Case Study in Education , 2015, SIMPDA.

[13]  Marta Indulska,et al.  Recommending Insightful Drill-Downs Based on Learning Processes for Learning Analytics Dashboards , 2020, AIED.

[14]  Moe Thandar Wynn,et al.  Measuring Patient Flow Variations: A Cross-Organisational Process Mining Approach , 2014, AP-BPM.

[15]  Aditya G. Parameswaran,et al.  Interactive Data Exploration with Smart Drill-Down , 2019, IEEE Transactions on Knowledge and Data Engineering.

[16]  Max Mühlhäuser,et al.  ProcessExplorer: Intelligent Process Mining Guidance , 2019, BPM.

[17]  Wil M. P. van der Aalst,et al.  Process variant comparison: Using event logs to detect differences in behavior and business rules , 2017, Inf. Syst..

[18]  Sander J. J. Leemans,et al.  Earth Movers' Stochastic Conformance Checking , 2019, BPM Forum.

[19]  Aditya G. Parameswaran,et al.  Avoiding drill-down fallacies with VisPilot: assisted exploration of data subsets , 2019, IUI.

[20]  Bart Baesens,et al.  Active Trace Clustering for Improved Process Discovery , 2013, IEEE Transactions on Knowledge and Data Engineering.

[21]  Wil M. P. van der Aalst,et al.  Context Aware Trace Clustering: Towards Improving Process Mining Results , 2009, SDM.

[22]  K. Dearden,et al.  The power of positive deviance , 2004, BMJ : British Medical Journal.