Differentially expressed genes selection via Truncated Nuclear Norm Regularization

Robust Principal Component Analysis (RPCA) is an efficient method in the selection of differentially expressed genes. However, nuclear norm minimizes all singular values simultaneously, so it may not be the best solution to replace the low-rank function. In this paper, the truncated nuclear norm is introduced. And a new method named Truncated nuclear norm regularized Robust Principal Component Analysis (TRPCA) is proposed. The method decomposes the observation matrix of genomic data into a low-rank matrix and a sparse matrix. The differentially expressed genes can be selected according to the sparse matrix. The experimental results on the The Cancer Genome Atlas (TCGA) data illustrate that the TRPCA method outperforms other state-of-the-art methods in the selection of differentially expressed genes.

[1]  Yi Ma,et al.  Robust principal component analysis? , 2009, JACM.

[2]  Shiqian Ma,et al.  Fixed point and Bregman iterative methods for matrix rank minimization , 2009, Math. Program..

[3]  Jing-Yu Yang,et al.  Characteristic Gene Selection via Weighting Principal Components by Singular Values , 2012, PloS one.

[4]  Christian C Abnet,et al.  Serum pepsinogens and risk of esophageal squamous dysplasia , 2009, International journal of cancer.

[5]  Xuelong Li,et al.  Fast and Accurate Matrix Completion via Truncated Nuclear Norm Regularization , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[6]  Yilun Wang,et al.  Truncated Nuclear Norm Minimization for Image Restoration Based On Iterative Support Detection , 2014, ArXiv.

[7]  Jin Tang,et al.  Graph-Laplacian PCA: Closed-Form Solution and Robustness , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[8]  Yong Xu,et al.  Robust PCA based method for discovering differentially expressed genes , 2013, BMC Bioinformatics.

[9]  Wei Guo,et al.  JAK/STAT signal pathway activation promotes progression and survival of human oesophageal squamous cell carcinoma , 2012, Clinical and Translational Oncology.

[10]  G. Sapiro,et al.  A collaborative framework for 3D alignment and classification of heterogeneous subvolumes in cryo-electron tomography. , 2013, Journal of structural biology.

[11]  David Botstein,et al.  GO: : TermFinder--open source software for accessing Gene Ontology information and finding significantly enriched Gene Ontology terms associated with a list of genes , 2004, Bioinform..

[12]  Pablo A. Parrilo,et al.  Guaranteed Minimum-Rank Solutions of Linear Matrix Equations via Nuclear Norm Minimization , 2007, SIAM Rev..

[13]  Junbin Gao,et al.  Laplacian Regularized Low-Rank Representation and Its Applications , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[14]  R. Tibshirani,et al.  Sparse Principal Component Analysis , 2006 .

[15]  Yong Xu,et al.  Extracting plants core genes responding to abiotic stresses by penalized matrix decomposition , 2012, Comput. Biol. Medicine.

[16]  Nayoung Kim,et al.  The role of serum pepsinogen in the detection of gastric cancer. , 2010, Gut and liver.

[17]  Samuel B. Williams,et al.  ASSOCIATION FOR COMPUTING MACHINERY , 2000 .