Tensor factorization toward precision medicine

Precision medicine initiatives come amid the rapid growth in quantity and variety of biomedical data, which exceeds the capacity of matrix-oriented data representations and many current analysis algorithms. Tensor factorizations extend the matrix view to multiple modalities and support dimensionality reduction methods that identify latent groups of data for meaningful summarization of both features and instances. In this opinion article, we analyze the modest literature on applying tensor factorization to various biomedical fields including genotyping and phenotyping. Based on the cited work including work of our own, we suggest that tensor applications could serve as an effective tool to enable frequent updating of medical knowledge based on the continually growing scientific and clinical evidence. We encourage extensive experimental studies to tackle challenges including design choice of factorizations, integrating temporality and algorithm scalability.

[1]  Peter Szolovits,et al.  Automatic lymphoma classification with sentence subgraph mining from pathology reports. , 2014, Journal of the American Medical Informatics Association : JAMIA.

[2]  Ping Zhang,et al.  Clinical risk prediction with multilinear sparse logistic regression , 2014, KDD.

[3]  Rasmus Bro,et al.  Multiway analysis of epilepsy tensors , 2007, ISMB/ECCB.

[4]  J. Chang,et al.  Analysis of individual differences in multidimensional scaling via an n-way generalization of “Eckart-Young” decomposition , 1970 .

[5]  Peter Szolovits,et al.  Subgraph augmented non-negative tensor factorization (SANTF) for modeling clinical narrative text , 2015, J. Am. Medical Informatics Assoc..

[6]  O. Alter,et al.  A Higher-Order Generalized Singular Value Decomposition for Comparison of Global mRNA Expression from Multiple Organisms , 2011, PloS one.

[7]  Bülent Yener,et al.  Multiway modeling and analysis in stem cell systems biology , 2008, BMC Systems Biology.

[8]  L. Tucker,et al.  Some mathematical notes on three-mode factor analysis , 1966, Psychometrika.

[9]  Xiaofeng Gong,et al.  Tensor decomposition of EEG signals: A brief review , 2015, Journal of Neuroscience Methods.

[10]  David B. Dunson,et al.  Bayesian Conditional Tensor Factorizations for High-Dimensional Classification , 2013, Journal of the American Statistical Association.

[11]  M. Ng,et al.  MultiFacTV: module detection from higher-order time series biological data , 2013, BMC Genomics.

[12]  Haifeng Li,et al.  Integrative Analysis of Many Weighted Co-Expression Networks Using Tensor Computation , 2011, PLoS Comput. Biol..

[13]  Andrzej Cichocki,et al.  Nonnegative Tensor Factorization for Continuous EEG Classification , 2007, Int. J. Neural Syst..

[14]  Tamara G. Kolda,et al.  Tensor Decompositions and Applications , 2009, SIAM Rev..

[15]  R. Altman,et al.  Collective judgment predicts disease-associated single nucleotide variants , 2013, BMC Genomics.

[16]  Bülent Yener,et al.  Sublineage structure analysis of Mycobacterium tuberculosis complex strains using multiple-biomarker tensors , 2011, BMC Genomics.

[17]  Orly Alter,et al.  Tensor GSVD of Patient- and Platform-Matched Tumor and Normal DNA Copy-Number Profiles Uncovers Chromosome Arm-Wide Patterns of Tumor-Exclusive Platform-Consistent Alterations Encoding for Cell Transformation and Predicting Ovarian Cancer Survival , 2015, PloS one.

[18]  Jimeng Sun,et al.  Marble: high-throughput phenotyping from electronic health records via sparse nonnegative tensor factorization , 2014, KDD.

[19]  Alioune Ngom,et al.  Non-negative matrix and tensor factorization based classification of clinical microarray gene expression data , 2010, 2010 IEEE International Conference on Bioinformatics and Biomedicine (BIBM).

[20]  Orly Alter,et al.  Tensor Decomposition Reveals Concurrent Evolutionary Convergences and Divergences and Correlations with Structural Motifs in Ribosomal RNA , 2011, PloS one.

[21]  Lars Kai Hansen,et al.  Parallel Factor Analysis as an exploratory tool for wavelet transformed event-related EEG , 2006, NeuroImage.

[22]  Jimeng Sun,et al.  Rubik: Knowledge Guided Tensor Factorization and Completion for Health Data Analytics , 2015, KDD.

[23]  Jimeng Sun,et al.  Limestone: High-throughput candidate phenotype generation via tensor factorization , 2014, J. Biomed. Informatics.

[24]  David Dunson,et al.  Bayesian Factorizations of Big Sparse Tensors , 2013, Journal of the American Statistical Association.

[25]  Mark Jenkinson,et al.  Non-local Shape Descriptor: A New Similarity Metric for Deformable Multi-modal Registration , 2011, MICCAI.

[26]  Gene H Golub,et al.  Reconstructing the pathways of a cellular system from genome-scale signals by using matrix and tensor computations. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[27]  Ben Taskar,et al.  Regularized Tensor Factorization for Multi-Modality Medical Image Classification , 2011, MICCAI.

[28]  D. Botstein,et al.  Generalized singular value decomposition for comparative analysis of genome-scale expression data sets of two different organisms , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[29]  G. Golub,et al.  A tensor higher-order singular value decomposition for integrative analysis of DNA microarray data from different studies , 2007, Proceedings of the National Academy of Sciences.

[30]  Andrey A. Puretskiy,et al.  Nonnegative Tensor Factorization of Biomedical Literature for Analysis of Genomic Data , 2014 .

[31]  David B. Dunson,et al.  Scalable Bayesian Low-Rank Decomposition of Incomplete Multiway Tensors , 2014, ICML.

[32]  David B. Dunson,et al.  Learning phenotype densities conditional on many interacting predictors , 2014, Bioinform..

[33]  Shuiwang Ji,et al.  Computational network analysis of the anatomical and genetic organizations in the mouse brain , 2011, Bioinform..

[34]  Katsutoshi Yada Data Mining for Service , 2014 .

[35]  H. Sebastian Seung,et al.  Learning the parts of objects by non-negative matrix factorization , 1999, Nature.

[36]  O. Alter,et al.  Global effects of DNA replication and DNA replication origin activity on eukaryotic gene expression , 2009, Molecular systems biology.