Diagnosis prediction based on similarity of patients physiological parameters

Medical staff can be considerably supported in patient healthcare delivery thanks to the adoption of machine learning and deep learning methods by enhancing clinicians decisions and analysis with targeted clinical knowledge, patient information, and other health data. This paper proposes a learning methodology that, on the basis of the current patient health status, clinical history, diagnostic and laboratory results, provides insights for clinicians in the diagnosis and therapy decision processes. The approach relies on the concept that patients with similar vital signs patterns are, in all probability, affected by the same or very similar health problems. Thus, they can have the same or very similar diagnoses. Patients physiological signals are modeled as time series and the similarity among them is exploited. The method is formulated as a classification problem in which an ad-hoc multi-label k-nearest neighbor approach is combined with similarity concepts based on word embedding. Experimental results on real-world clinical data have shown that the proposed approach allows detecting diagnoses with a precision up to about 75%.

[1]  D. Coomans,et al.  Alternative k-nearest neighbour rules in supervised pattern recognition : Part 1. k-Nearest neighbour classification by using alternative voting rules , 1982 .

[3]  Molly Wilkinson,et al.  Dynamic time warping outperforms Pearson correlation in detecting atypical functional connectivity in autism spectrum disorders , 2020, NeuroImage.

[4]  S. Brunak,et al.  Mining electronic health records: towards better research applications and clinical care , 2012, Nature Reviews Genetics.

[5]  Matteo Pagliardini,et al.  Unsupervised Learning of Sentence Embeddings Using Compositional n-Gram Features , 2017, NAACL.

[6]  Yan Liu,et al.  Benchmarking deep learning models on large healthcare datasets , 2018, J. Biomed. Informatics.

[7]  Peter Szolovits,et al.  MIMIC-III, a freely accessible critical care database , 2016, Scientific Data.

[8]  Gintautas Dzemyda,et al.  Method for Visual Detection of Similarities in Medical Streaming Data , 2015, Int. J. Comput. Commun. Control.

[9]  Mark Hoogendoorn,et al.  Prediction using patient comparison vs. modeling: A case study for mortality prediction , 2016, 2016 38th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC).

[10]  Tomas Mikolov,et al.  Enriching Word Vectors with Subword Information , 2016, TACL.

[11]  Jeffrey Dean,et al.  Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.

[12]  Alfredo Pulvirenti,et al.  Similarity Measures and Dimensionality Reduction Techniques for Time Series Data Mining , 2012 .

[13]  Walter F. Stewart,et al.  Doctor AI: Predicting Clinical Events via Recurrent Neural Networks , 2015, MLHC.

[14]  Jiang Jiang,et al.  A Bayesian approach for sleep and wake classification based on dynamic time warping method , 2017, Multimedia Tools and Applications.

[15]  Rosalind W. Picard,et al.  Deep Reinforcement Learning for Optimal Critical Care Pain Management with Morphine using Dueling Double-Deep Q Networks , 2019, 2019 41st Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC).

[16]  Aram Galstyan,et al.  Multitask learning and benchmarking with clinical time series data , 2017, Scientific Data.

[17]  António E. Ruano,et al.  Comparison of different methods of measuring similarity in physiologic time series , 2017 .

[18]  Volkan Tuzcu,et al.  Dynamic time warping as a novel tool in pattern recognition of ECG changes in heart rhythm disturbances , 2005, 2005 IEEE International Conference on Systems, Man and Cybernetics.