Mining Disease Courses across Organizations: A Methodology Based on Process Mining of Diagnosis Events Datasets

This work proposes the use of Process Mining methodologies on healthcare datasets containing diagnosis information as a means to identify the course of a disease across organizations. Datasets containing diagnosis information for administrative purposes are a good candidate due to its standardized format, widespread availability and coverage. We present a methodology to preprocess, cluster and mine diagnosis information and the results of a preliminary use case with diabetes type II. Some meaningful disease courses have been found but less useful patterns do also emerge. Future work involves lowering the level of granularity chosen (ICD three digit codes) and extending the time span of the data available (three years).

[1]  Jorge Munoz-Gama,et al.  Process mining in healthcare: A literature review , 2016, J. Biomed. Informatics.

[2]  Diogo R. Ferreira,et al.  Understanding Spaghetti Models with Sequence Clustering for ProM , 2009, Business Process Management Workshops.

[3]  Seppe K. L. M. vanden Broucke,et al.  A guide for the application of analytics on healthcare processes: A dynamic view on patient pathways , 2016, Comput. Biol. Medicine.

[4]  Wil M. P. van der Aalst,et al.  Process Mining in Healthcare: Data Challenges When Answering Frequently Posed Questions , 2012, ProHealth/KR4HC.

[5]  Wil M. P. van der Aalst,et al.  Process Mining , 2016, Springer Berlin Heidelberg.

[6]  Eduardo Alves Portela Santos,et al.  Managing Business Process Variability Through Process Mining and Semantic Reasoning: An Application in Healthcare , 2017, PRO-VE.

[7]  Diogo R. Ferreira,et al.  Business process analysis in healthcare environments: A methodology based on process mining , 2012, Inf. Syst..

[8]  Yuval Shahar,et al.  Exploration of patterns predicting renal damage in patients with diabetes type II using a visual temporal analysis laboratory , 2015, J. Am. Medical Informatics Assoc..

[9]  Luigi Pontieri,et al.  Discovering expressive process models by clustering log traces , 2006, IEEE Transactions on Knowledge and Data Engineering.

[10]  Fei Wang,et al.  Investigating Clinical Care Pathways Correlated with Outcomes , 2013, BPM.

[11]  Wil M. P. van der Aalst,et al.  Process Mining - Discovery, Conformance and Enhancement of Business Processes , 2011 .

[12]  Wil M.P. van der Aalst,et al.  Fuzzy Mining - Adaptive Process Simplification Based on Multi-perspective Metrics , 2007, BPM.

[13]  George Hripcsak,et al.  Exploiting time in electronic health record correlations , 2011, J. Am. Medical Informatics Assoc..

[14]  Wil M. P. van der Aalst,et al.  Application of Process Mining in Healthcare - A Case Study in a Dutch Hospital , 2008, BIOSTEC.

[15]  R. P. Jagadeesh Chandra Bose,et al.  Process mining in the large : preprocessing, discovery, and diagnostics , 2012 .

[16]  K. Ganesha,et al.  Analyzing the waiting time of patients in hospital by applying heuristics process miner , 2017, 2017 International Conference on Inventive Communication and Computational Technologies (ICICCT).

[17]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .