Mining post-surgical care processes in breast cancer patients

In this work we describe the application of a careflow mining algorithm to detect the most frequent patterns of care in a cohort of 3000 breast cancer patients. The applied method relies on longitudinal data extracted from electronic health records, recorded from the first surgical procedure after a breast cancer diagnosis. Careflows are mined from events data recorded for administrative purposes, including procedures from ICD9 - CM billing codes and chemotherapy treatments. Events data have been pre-processed with Topic Modelling to create composite events based on concurrent procedures. The results of the careflow mining algorithm allow the discovery of electronic temporal phenotypes across the studied population. These phenotypes are further characterized on the basis of clinical traits and tumour histopathology, as well as in terms of relapses, metastasis occurrence and 5-year survival rates. Results are highly significant from a clinical perspective, since phenotypes describe well characterized pathology classes, and the careflows are well matched with existing clinical guidelines. The analysis thus facilitates deriving real-world evidence that can inform clinicians as well as hospital decision makers.

[1]  Serena Mazzucchelli,et al.  Autologous fat transfer after breast cancer surgery: An exact-matching study on the long-term oncological safety. , 2019, European journal of surgical oncology : the journal of the European Society of Surgical Oncology and the British Association of Surgical Oncology.

[2]  Riccardo Bellazzi,et al.  Temporal electronic phenotyping by mining careflows of breast cancer patients , 2017, J. Biomed. Informatics.

[3]  Jianmin Wang,et al.  Incorporating Topic Assignment Constraint and Topic Correlation Limitation into Clinical Goal Discovering for Clinical Pathway Mining , 2017, Journal of healthcare engineering.

[4]  Jacques Bouaud,et al.  Explainable artificial intelligence for breast cancer: A visual case-based reasoning approach , 2019, Artif. Intell. Medicine.

[5]  Wil M. P. van der Aalst Process mining , 2012, CACM.

[6]  Wil M. P. van der Aalst,et al.  Fuzzy Mining - Adaptive Process Simplification Based on Multi-perspective Metrics , 2007, BPM.

[7]  Silke Adam,et al.  Applying LDA Topic Modeling in Communication Research: Toward a Valid and Reliable Methodology , 2018 .

[8]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[9]  Wil M. P. van der Aalst,et al.  Process Mining in Healthcare: Data Challenges When Answering Frequently Posed Questions , 2012, ProHealth/KR4HC.

[10]  Riccardo Bellazzi,et al.  Careflow Mining Techniques to Explore Type 2 Diabetes Evolution , 2018, Journal of diabetes science and technology.

[11]  Geoff Hall,et al.  Process mining routinely collected electronic health records to define real-life clinical pathways during chemotherapy , 2017, Int. J. Medical Informatics.

[12]  Wil M. P. van der Aalst,et al.  Process mining: discovering and improving Spaghetti and Lasagna processes , 2011, CIDM 2011.

[13]  Silvana Quaglini,et al.  Leveraging semantic labels for multi-level abstraction in medical process mining and trace comparison , 2018, J. Biomed. Informatics.

[14]  Huilong Duan,et al.  On mining clinical pathway patterns from medical behaviors , 2012, Artif. Intell. Medicine.

[15]  Milos Hauskrecht,et al.  Nonparametric Regressive Point Processes Based on Conditional Gaussian Processes , 2019, NeurIPS.

[16]  Eduardo Alves Portela Santos,et al.  Applying process mining and semantic reasoning for process model customisation in healthcare , 2019, Enterp. Inf. Syst..

[17]  Xiao Xu,et al.  TCPM: Topic-Based Clinical Pathway Mining , 2016, 2016 IEEE First International Conference on Connected Health: Applications, Systems and Engineering Technologies (CHASE).

[18]  Uzay Kaymak,et al.  On process mining in health care , 2012, 2012 IEEE International Conference on Systems, Man, and Cybernetics (SMC).

[19]  Artie Basukoski,et al.  Discovering Business Processes in CRM Systems by Leveraging Unstructured Text Data , 2018, 2018 IEEE 20th International Conference on High Performance Computing and Communications; IEEE 16th International Conference on Smart City; IEEE 4th International Conference on Data Science and Systems (HPCC/SmartCity/DSS).

[20]  Mauro Vallati,et al.  pMineR: An Innovative R Library for Performing Process Mining in Medicine , 2017, AIME.

[21]  Huilong Duan,et al.  Discovery of clinical pathway patterns from event logs using probabilistic topic models , 2014, J. Biomed. Informatics.

[22]  SepúlvedaMarcos,et al.  Process mining in healthcare , 2016 .

[23]  Jorge Munoz-Gama,et al.  Process mining in healthcare: A literature review , 2016, J. Biomed. Informatics.

[24]  David B. Dunson,et al.  Probabilistic topic models , 2012, Commun. ACM.