Revealing Cause-Effect Relations in Comorbidities Analysis Using Process Mining and Tensor Network Decomposition

The existence of certain comorbidities, the co-occurrence of different diseases in the same individual, is well-known in the medical community. However, finding temporal cause-effect relations between those diseases constitutes a big challenge. Clinical records can be used to extract the required information, but their analysis is elusive due to the vast and heterogeneous amount of data. We propose a new methodology for time-preserving tensor networks decomposition to be applied in the analysis of big data problems where the temporal dimension of the key factual fields must not be modified. This methodology will also allow the creation of a new process mining modeling which can capture the cause-effect relations as low-order tensors associated to the transitions of the mined processes, and whose structure takes the form of a multilayer complex network. All these theoretical and methodological advances will allow their application to real biomedical data to analyze comorbidities.

[1]  David J. Schwab,et al.  Supervised Learning with Tensor Networks , 2016, NIPS.

[2]  Eugene E. Tyrtyshnikov,et al.  Breaking the Curse of Dimensionality, Or How to Use SVD in Many Dimensions , 2009, SIAM J. Sci. Comput..

[3]  Fran Casino,et al.  Trends and challenges in smart healthcare research: A journey from data to wisdom , 2017, 2017 IEEE 3rd International Forum on Research and Technologies for Society and Industry (RTSI).

[4]  H. Quan,et al.  Coding Algorithms for Defining Comorbidities in ICD-9-CM and ICD-10 Administrative Data , 2005, Medical care.

[5]  Alexander L. Wolf,et al.  Discovering models of software processes from event-based data , 1998, TSEM.

[6]  Nikos D. Sidiropoulos,et al.  Tensor Decomposition for Signal Processing and Machine Learning , 2016, IEEE Transactions on Signal Processing.

[7]  Anindya Datta,et al.  Automating the Discovery of AS-IS Business Process Models: Probabilistic and Algorithmic Approaches , 1998, Inf. Syst. Res..

[8]  Dimitrios Gunopulos,et al.  Mining Process Models from Workflow Logs , 1998, EDBT.

[9]  Ivan Oseledets,et al.  Tensor-Train Decomposition , 2011, SIAM J. Sci. Comput..

[10]  Wil M. P. van der Aalst,et al.  Process Mining - Discovery, Conformance and Enhancement of Business Processes , 2011 .

[11]  F. L. Hitchcock The Expression of a Tensor or a Polyadic as a Sum of Products , 1927 .

[12]  A. Arenas,et al.  Mathematical Formulation of Multilayer Networks , 2013, 1307.4977.

[13]  Vito Latora,et al.  Structural reducibility of multilayer networks , 2015, Nature Communications.

[14]  W. Hackbusch,et al.  A New Scheme for the Tensor Representation , 2009 .

[15]  Joachim Herbst,et al.  A Machine Learning Approach to Workflow Management , 2000, ECML.

[16]  Lars Grasedyck,et al.  Hierarchical Singular Value Decomposition of Tensors , 2010, SIAM J. Matrix Anal. Appl..

[17]  Andrzej Cichocki,et al.  Tensor Networks for Dimensionality Reduction, Big Data and Deep Learning , 2018, Advances in Data Analysis with Computational Intelligence Methods.

[18]  Roberto Di Pietro,et al.  Smart health: A context-aware health paradigm within smart cities , 2014, IEEE Communications Magazine.

[19]  Tamara G. Kolda,et al.  Scalable Tensor Decompositions for Multi-aspect Data Mining , 2008, 2008 Eighth IEEE International Conference on Data Mining.