Multi-task Sparse Metric Learning for Monitoring Patient Similarity Progression

A clinically meaningful distance metric, which is learned from measuring patient similarity, plays an important role in clinical decision support applications. Several metric learning approaches have been proposed to measure patient similarity, but they are mostly designed for learning the metric at only one time point/interval. It leads to a problem that those approaches cannot reflect the similarity variations among patients with the progression of diseases. In order to capture similarity information from multiple future time points simultaneously, we formulate a multi-task metric learning approach to identify patient similarity. However, it is challenging to directly apply traditional multi-task metric learning methods to learn such similarities due to the high dimensional, complex and noisy nature of healthcare data. Besides, the disease labels often have clinical relationships, which should not be treated as independent. Unfortunately, traditional formulation of the loss function ignores the degree of labels' similarity. To tackle the aforementioned challenges, we propose mtTSML, a multi-task triplet constrained sparse metric learning method, to monitor the similarity progression of patient pairs. In the proposed model, the distance for each task can be regarded as the combination of a common part and a task-specific one in the transformed low-rank space. We then perform sparse feature selection for each individual task to select the most discriminative information. Moreover, we use triplet constraints to guarantee the margin between similar and less similar pairs according to the ordered information of disease severity levels (i.e. labels). The experimental results on two real-world healthcare datasets show that the proposed multi-task metric learning method significantly outperforms the state-of-the-art baselines, including both single-task and multi-task metric learning methods.

[1]  Jimeng Sun,et al.  Integrating Distance Metrics Learned from Multiple Experts and its Application in Inter-Patient Similarity Assessment , 2011, SDM.

[2]  Yuan Shi,et al.  Sparse Compositional Metric Learning , 2014, AAAI.

[3]  C. Cooper,et al.  Fortnightly Review: Bone densitometry in clinical practice , 1995, BMJ.

[4]  Fenglong Ma,et al.  MuVAN: A Multi-view Attention Network for Multivariate Temporal Data , 2018, 2018 IEEE International Conference on Data Mining (ICDM).

[5]  Kaizhu Huang,et al.  Geometry Preserving Multi-task Metric Learning , 2012, ECML/PKDD.

[6]  Frédéric Jurie,et al.  PCCA: A new approach for distance learning from sparse pairwise constraints , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[7]  Jiayu Zhou,et al.  A multi-task learning formulation for predicting disease progression , 2011, KDD.

[8]  Xiang Wang,et al.  Unsupervised learning of disease progression models , 2014, KDD.

[9]  Chenglin Miao,et al.  Uncorrelated Patient Similarity Learning , 2018, SDM.

[10]  Dit-Yan Yeung,et al.  Transfer metric learning by learning task relationships , 2010, KDD.

[11]  Jimeng Sun,et al.  Predicting Patient's Trajectory of Physiological Data using Temporal Trends in Similar Patients: A System for Near-Term Prognostics. , 2010, AMIA ... Annual Symposium proceedings. AMIA Symposium.

[12]  Jiayu Zhou,et al.  Modeling disease progression via fused sparse group lasso , 2012, KDD.

[13]  Fei Wang,et al.  PSF: A Unified Patient Similarity Evaluation Framework Through Metric Learning With Weak Supervision , 2015, IEEE Journal of Biomedical and Health Informatics.

[14]  Joon Lee,et al.  Personalized Mortality Prediction Driven by Electronic Medical Data and a Patient Similarity Metric , 2015, PloS one.

[15]  Shiyu Chang,et al.  Low-Rank Sparse Feature Selection for Patient Similarity Learning , 2016, 2016 IEEE 16th International Conference on Data Mining (ICDM).

[16]  M. Kowalski Sparse regression using mixed norms , 2009 .

[17]  Fenglong Ma,et al.  Risk Prediction on Electronic Health Records with Prior Medical Knowledge , 2018, KDD.

[18]  P. Groenen,et al.  Modern Multidimensional Scaling: Theory and Applications , 1999 .

[19]  Gaurav Sharma,et al.  CP-mtML: Coupled Projection Multi-Task Metric Learning for Large Scale Face Retrieval , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[20]  Marc Sebban,et al.  A Survey on Metric Learning for Feature Vectors and Structured Data , 2013, ArXiv.

[21]  Jianying Hu,et al.  Towards Personalized Medicine: Leveraging Patient Similarity and Drug Similarity Analytics , 2014, AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science.

[22]  Dacheng Tao,et al.  Person Re-Identification Over Camera Networks Using Multi-Task Distance Metric Learning , 2014, IEEE Transactions on Image Processing.

[23]  Jianping Fan,et al.  Hierarchical learning of multi-task sparse metrics for large-scale image classification , 2017, Pattern Recognit..

[24]  Fenglong Ma,et al.  A Multi-task Framework for Monitoring Health Conditions via Attention-based Recurrent Neural Networks , 2017, AMIA.

[25]  Chenglin Miao,et al.  Metric Learning from Probabilistic Labels , 2018, KDD.

[26]  Springer-Verlag London Limited A multi-task framework for metric learning with common subspace , 2013 .

[27]  Wei Xiao,et al.  Prognosis and Diagnosis of Parkinson's Disease Using Multi-Task Learning , 2017, KDD.

[28]  Fenglong Ma,et al.  Deep Patient Similarity Learning for Personalized Healthcare , 2018, IEEE Transactions on NanoBioscience.

[29]  Suvrit Sra,et al.  Geometric Mean Metric Learning , 2016, ICML.

[30]  Fei Wang,et al.  Measuring Patient Similarities via a Deep Architecture with Medical Concept Embedding , 2016, 2016 IEEE 16th International Conference on Data Mining (ICDM).

[31]  R. Sharan,et al.  A method for inferring medical diagnoses from patient similarities , 2013, BMC Medicine.

[32]  Fei Wang,et al.  Supervised patient similarity measure of heterogeneous patient records , 2012, SKDD.

[33]  Volker Roth,et al.  A Complete Analysis of the l_1, p Group-Lasso , 2012, ICML.

[34]  Kilian Q. Weinberger,et al.  Distance Metric Learning for Large Margin Nearest Neighbor Classification , 2005, NIPS.

[35]  Fei Wang,et al.  An RNN Architecture with Dynamic Temporal Matching for Personalized Predictions of Parkinson's Disease , 2017, SDM.

[36]  Jieping Ye,et al.  A Multi-Task Learning Formulation for Survival Analysis , 2016, KDD.

[37]  Dan Ye,et al.  Fine-grained Patient Similarity Measuring using Deep Metric Learning , 2017, CIKM.

[38]  Kebin Jia,et al.  Wave2Vec: Learning Deep Representations for Biosignals , 2017, 2017 IEEE International Conference on Data Mining (ICDM).

[39]  Jiayu Zhou,et al.  Multi-Task Learning based Survival Analysis for Predicting Alzheimer's Disease Progression with Multi-Source Block-wise Missing Data , 2018, SDM.

[40]  Stephen P. Boyd,et al.  Distributed Optimization and Statistical Learning via the Alternating Direction Method of Multipliers , 2011, Found. Trends Mach. Learn..

[41]  Kilian Q. Weinberger,et al.  Large Margin Multi-Task Metric Learning , 2010, NIPS.